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r—^ ABSTRACT 

o 

The final product of galaxy evolution through cosmic time is the population of galaxies in the Local Universe. These galaxies are also those 

• that can be studied in most detail, thus providing a stringent benchmark for our understanding of galaxy evolution. Through the huge success 
of spectroscopic single-fiber, statistical surveys of the Local Universe in the last decade, it has become clear, however, that an authoritative 

(2Lh observational description of galaxies will involve measuring their spatially resolved properties over their full optical extent (covering D25) for a 
statistically significant sample. We present here the Calar Alto Legacy Integral Field Area (CALIFA) survey, which has been designed to provide a 
first step in this direction. We summarize the survey goals and design, including sample selection and observational strategy. We also showcase the 
data taken during the first observing runs (June/July 2010) and outline the reduction pipeline, quality control schemes and general characteristics 
of the reduced data. 

This survey is obtaining spatially resolved spectroscopic information of a diameter selected sample of ~600 galaxies in the Local Universe 
(0.005< z <0.03). CALIFA has been designed to allow the building of two-dimensional maps of the following quantities: (a) stellar populations: 
ages and metallicities; (b) ionized gas: distribution, excitation mechanism and chemical abundances; and (c) kinematic properties: both from stellar 
^ and ionized gas components. CALIFA uses the PPAK Integral Field Unit (IFU), with a hexagonal field-of-view of ~1.3n', with a 100% covering 
(Sj factor by adopting a three -pointing dithering scheme. The optical wavelength range is covered from 3700 to 7000 A, using two overlapping setups 
\^ (V500 and V1200), with diff'erent resolutions: R~850 and R~1650, respectively. CALIFA is a legacy survey, intended for the community. The 
^\ reduced data will be released, once the quality has been guaranteed. 

The analyzed data fulfill the expectations of the original observing proposal, on the basis of a set of quality checks and exploratory analysis: (i) 

• the final datacubes reach a 3cr limiting surface brightness depth of ~ 23.0 mag/arcsec^ for the V500 grating data (~22.8 mag/arcsec^ for VI 200); 
, (ii) about ~70% of the covered field-of-view is above this 3cr limit; (iii) the data have a blue-to-red relative flux calibration within a few percent in 
, most of the wavelength range; (iv) the absolute flux calibration is accurate within ~8% with respect to SDSS; (v) the measured spectral resolution 
, is ~85 km s~^ for V1200 (~150 km s~^ for V500); (vi) the estimated accuracy of the wavelength calibration is ~5 km s~^ for the V1200 data 

\ [ (~10 km s~^ for the V500 data); (vii) the aperture matched CALIFA and SDSS spectra are qualitatively and quantitatively similar Finally, we 
^ show that we are able to carry out all measurements indicated above, recovering the properties of the stellar populations, the ionized gas and the 

• ^ kinematics of both components. The associated maps illustrate the spatial variation of these parameters across the field, reemphasizing the redshift 
dependence of single aperture spectroscopic measurements. We conclude from this first look at the data that CALIFA will be an important resource 
?-H for archaeological studies of galaxies in the Local Universe. 

Key words, techniques: spectroscopic - galaxies: abundances - stars: formation - galaxies: ISM - galaxies: stellar content 



1. Introduction 

Our understanding of the Universe and its con stituents come s 
from large surveys such as the 2dFGRS ( ,Folkes et al.||1999) , 



* Based on observations collected at the Centro Astronomico 
Hispano Aleman (CAHA) at Calar Alto, operated jointly by the Max- 
Planck-Institut fiir Astronomic and the Instituto de Astrofisica de 
Andalucia (CSIC). 



SPSS ([York et al.' '2Q00^), GEMS (|Rix et al.|[2004l), V VDS (|L| 
|Fevreetal.|2QQ4l )rand COSMOS ( |Scoville et al.|2007a| ), to name 
but a few. Such surveys have not only constrained the evolu- 
tion of global quantities such as the cosmic star formation rate, 
but also enabled us to link these with the properties of individ- 
ual galaxies - morphological types, stellar masses, metallicities, 
etc. Compared to previous approaches, the major advantages of 
this recent generation of surveys are: (1) the large number of ob- 
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jects sampled, allowing for meaningful statistical analysis to be 
performed on an unprecedented scale; (2) the possibility to con- 
struct large comparison/control samples for each subset of galax- 
ies; (3) a broad coverage of galaxy subtypes and environmental 
conditions, allowing for the derivation of universal conclusions; 
and (4) the homogeneity of the data acquisition, reduction and 
(in some cases) analysis. 

On the other hand, the cost of these surveys, in terms of tele- 
scope time, person-power, and time scales involved, is also un- 
precedented in astronomy. The user of such data products has 
not necessarily been involved in any step of designing or con- 
ducting the survey, but nevertheless takes advantage of the data 
by exploiting them according to her/his scientific interests. This 
new approach to observational astronomy is also changing our 
perception of the scientific rationale behind a new survey: while 
it is clear that certain planned scientific applications are key de- 
terminants to the design of the observations and 'drive' the sur- 
vey, the survey data should, at the same time, allow for a broad 
range of scientific exploitation. This aspect is now often called a 
survey's legacy value. 

Current technology generally leads to surveys either in the 
imaging or in the spectroscopic domain. While imaging surveys 
provide two-dimensional coverage, the information content of a 
photometric Spectral Energy Distribution (SED) is limited. This 
remains true for the new generation of multi-band photometric 
surveys such as CQ MBQ-17 (Wolf et al.,2QQ3j , ALHAMBRA 
dMoles et al.||20Q8|), the planned PAU p roject ( [Bemtez et al. 
2QQ9| ), COSMOS ( [Scoville et al.||2QQ7b| ) or the LUS surve}{5 
which are geared towards better precision in redshift, mean ages 
and stellar masses, but are nevertheless unable to measure in- 
dividual spectral lines and thus emission line ratios or internal 
radial velocity d iff'erences. Spect roscopic surveys such as SDSS 
or zCOSMOS ( |Lilly et al .'2007), on the other hand, do provide 
more detailed astrophysical information, but they are generally 
limited to one spectrum per galaxy. One thus misses all informa- 
tion on the radial distribution of galaxy properties and on all de- 
tails of the kinematics. Even when attempting to describe galax- 
ies by their integrated properties only, this state of aff'airs also 
leads to aperture losses that are difficult to control. For example, 
the 3'' diameter of the fiber used in the SDSS corresponds to dif- 
ferent physical scales at diff'erent redshifts, w ith limited possibil- 
ities to correct for these aperture eff'ects (e.g., |Kewley et al.|2QQ5[ 
Ellis et al.|2005| ). The most popular method is to compare results 
of a fit to the photometry of the whole galaxy with the photomet- 
ric fit corresponding to the area of the fiber only ( [Gomez et al. 
2Q03tlBrinchmann et al. 2004) ). Even more severely^ lZibetti et aL 
( [20091 ) have recently shown that spatially resolved stellar popu- 
lation analysis may lead to corrections of up to 40% for the stel- 
lar mass of a galaxy, when compared to integrated light studies. 

An observational technique combining the advantages of 
imaging and spectroscopy (albeit with usually quite small field 
of view) is Integral Field Spectroscopy (IFS). This technique al- 
lows us to study both the integrated and spatially resolved spec- 
troscopic properties of galaxies. IFS has the potential to pro- 
vide observational evidence to constrain many outstanding ques- 
tions of baryonic physics which are key to our understanding of 
galaxy evolution and, therefore, cosmology. Some of these are 
(1) the importance and consequences of merging, major and mi- 
nor; (2) internal dynamical processes, such as bars, spiral arms, 
stellar migration; (3) environmental eff'ects, such as tidal forces, 
stripping; (4) AGN feedback; (5) occurrence, spatial and tempo- 
ral extent and trigger of star formation. Spatially resolved spec- 



troscopic properties of a statistical sample of nearby galaxies is 
the dataset required to address these questions. 

However, so far, IFS has rarely been used in a 'survey mode' 
to investigate sizeable samples. Among the fe w exceptions ther e 
is, most notably, the SAURON survey (de Zeeu w et all2002| ), 
focused on the study of the central regions of 72 nearby early- 
type galaxies and bulges of spirals, and its extension AT LAS^^ 
(260 early-type galaxies at z < 0.01; C appellari et aL]|2011|). 
Others are the on-going PINGS project ( jRosales-Ortega et"aL 
2010!) at the CAHA 3.5m of a dozen of very nearby galaxies 



(-10 Mpc) and the currently o ngoing study of 70 (U)LIRGS at 
z < 0.26 using diff'erent IFUs ( Arrib as et al.|2008! ). Finally, the 
VIRUS-P instrument is currently used to carry out two small IFS 
surveys, namely VENGA (30 spiral galaxies, Blanc et al.|2010 ) 
and VIXENS[^(15 starbursts). All these datasets are clearly 
focused on specific science questions, adopting correspond- 
ingly optimized sample selection criteria and also observing 
strategies. For example, at the redshifts of the galaxies in the 
ATLAS^^ sample SAURON has a field of view of 30'' x 40 ^ 
, or < 7 X 9 kpc, thus does not cover the outer parts of these 
galaxies. 

On completion, CALIFA will be the largest and the most 
comprehensive wide-field IFU survey of galaxies carried out to 
date. It will thus provide an invaluable bridge between large 
single aperture surveys and more detailed studies of individual 
galaxies. With CALIFA we will fix observational properties of 
galaxies in the Local Universe, which will have a potential im- 
pact in the interpretation of observed properties at higher redshift 
(e.g. |Epinatetal.|2010 |). 

CALIFA is an ongoing survey, which has been granted with 
210 dark nights by the Calar Alto Executive Committee, span- 
ning through 6 semesters. The gathering of the data started in 
June/July 2010, and after a technical problem with the telescope, 
it was resumed in March 2011. This technical problem, fully 
repaired, has not affected the quality of the CALIFA data. 
About 20 galaxies are observed per month. As mentioned above, 
there will be consecutive data releases of the fully reduced 
datasets, once certain milestones/number of observed galaxies 
are reached. The first one is planned for late 2012, when a total 
number of 100 galaxies is completed (ie., observed, reduced and 
quality of the data tested). The current status of data acquisition 
can be obtained from the CALIFA webpag^ 

In this article we present the main characteristics of the sur- 
vey, starting from the design requirements in Section [2] Section 
[3]describes the sample selection criteria, and the main character- 
istics of the CALIFA mother sample. In Section]?] we describe 
the observing strategy, in particular the observations performed 
during the first CALIFA runs. The data reduction is described in 
detail in Section |5j and some of the first quality tests performed 
on the data are presented in Section |6] The exploratory analy- 
sis performed on these first datasets, obtained in 2010, to verify 
that we will be able to reach our science goals are presented in 
Section [7] A summary and conclusion of the results is presented 
in Section [8l 



2. Design drivers 

CALIFA has been designed to increase our knowledge of the 
baryonic physics of galaxy evolution. We intend to characterize 
observationally the local galaxy population with the following 
key points in mind: 



http://www.inaoep.mx/~gtc-lus/ 



^ http ://w w w. as .utexas . edu/~ alh/vixens . html 
^ http://www.caha.es/CALIFA/ 
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- Sample covering a substantial fraction of the galaxy lumi- 
nosity function. 

- Large enough sample to allow statistically significant con- 
clusions for all classes of galaxies represented in the survey. 

- Characterization of galaxies over their full spatial extent 
(covering in most cases the R25 radius) , i.e. avoiding aper- 
ture biases and harnessing the additional power of 2D reso- 
lution (gradients, sub- structures: bars, spiral arms...). 

- Measurement of gas ionization mechanisms: star formation, 
shocks, AGN. 

- Measurement of ionized gas oxygen and nitrogen abun- 
dances. 

- Measurement of stellar population properties: ages, mass-to- 
light ratios, metallicities, and (to a limited extent) abundance 
patterns. 

- Measurement of galaxy kinematics in gas and stars, i.e. ve- 
locity fields for all galaxies and velocity dispersions for the 
more massive ones. 

A careful assessment of these, partially competing, drivers 
and of the practical limits imposed by the instrument, the ob- 
servatory and the timescale has led to the following key charac- 
teristics of the survey: (i) Sample: -600 galaxies of any kind in 
the Local Universe, covering the full color-magnitude diagram 

down to Mfi 18 mag, selected from the SDSS to allow good 

a-priori characterization of the targets; (ii) Instrument: PPAK 
IFU of the Potsdam Multi-Aperture Spectrograph (PMAS) in- 
strument at the 3.5m telescope of CAHA, with one of the largest 
fields-of-view for this kind of instruments (>1 arcmin^); (iii) 
Grating setups: two overlapping setups, one in the red (3750- 
7000 A, R~850, for ionized gas measurements and stellar popu- 
lations) and one in the blue (3700-4700 A, R~1650, for detailed 
stellar populations and for stellar and gas kinematics); and (iii) 
Exposure times: 1800s in the blue and 900s in the red. 

3. The CALIFA galaxy sample 

We now briefly describe how the sample has been constructed. 
We limit this section to some fundamental considerations that 
are important for understanding the potential and limitations of 
the survey. In a later paper we will present a detailed character- 
ization of the sample and the distribution of galaxy properties, 
including comparisons to other studies and the galaxy popula- 
tion as a whole. 

The guiding principle for building the CALIFA galaxy sam- 
ple followed from the broad set of scientific aims outlined in 
the previous section, i.e. the sample should cover a wide range 
of important galaxy properties such as morphological types, lu- 
minosities, stellar masses, and colours. Further constraints were 
added due to technical and observational boundary conditions, 
such as observing all galaxies with the same spectroscopic setup 
to make scheduling as flexible as possible. As detailed below, a 
small set of selection criteria was employed to define a sample 
of 937 possible target galaxies, the so-called "CALIFA mother 
sample". From this mother sample, the galaxies are drawn as 
actual targets only according to their visibility, i.e. in a quasi- 
random fashion, so that the final catalog of CALIFA galaxies 
will be a slightly sparsely sampled (-2/3) subset of the mother 
sample. 

The CALIFA mother sample was initially selected from the 
SDSS DR7 photometric catalog (Abazajian et al. 2009), which 
ensures the availability of good quality multi-band photometry, 
and in many cases (but not all, cf. below) nuclear spectra. The 
CALIFA footprint is thus largely identical to that of the SDSS 
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Fig. 1. Distribution of the targets in the CALIFA mother sample 
in equatorial coordinates. The distribution over the two Galactic 
caps is obvious. Note that the north galactic plane part of the 
sample has been limited to ^ > -h7° while the south galactic 
plane part does not have that limitation, to counterbalance the 
much lower number of SDSS objects in the south galactic plane 
region. The orange square at the center of the figure indicates 
the location of the Coma cluster. 



imaging survey, modified only by an additional restriction of 
^ > 7° for galaxies in the North Galactic hemisphere to ensure 
good visibility from the observatory. Note that this criterion was 
not applied to objects at southern Galactic latitudes, in order to 
at least partly counterbalance the much smaller sky coverage of 
SDSS data in this region and improve on the observability of 
CALIFA targets throughout the year. Figure [T] shows the result- 
ing distribution of the CALIFA mother sample in the sky. 

The defining selection criteria of the CALIFA mother sam- 
ple comprise a combination of angular isophotal diameter selec- 
tion with lower and upper redshift limits (0.005 < z < 0.03). 
We further justify and discuss the consequences of these selec- 
tion criteria later in this section. Note that while no explicit cut 
in apparent magnitude is involved, the adopted criteria implic- 
itly ensure that only relatively bright galaxies enter the sample, 
with a roughly Gaussian distribution of the total r-band magni- 
tudes centred on a mean of r = 13.3 and with a Icr dispersion of 
0.8 mag. No CALIFA galaxy is fainter than r = 16. In Fig.|2]we 
show the distribution of the mother sample in the fundamental 
observed properties, total r-band magnitudes and redshifts. 

The choice of redshift range was driven by two requirements: 

(1) Objects within the luminosity range of interest should have 
apparent sizes well matched to the PPAK field-of-view (FoV). 

(2) All relevant emission lines in all galaxies should be cov- 
ered with a single spectral setup. As measure of the apparent 
galaxy size we adopted the "isoA_r" values (isophotal diame- 
ters D25 in the SDSS r-band) and selected only galaxies with 
45'' < D25 < 80''. We demonstrate below in Sect. [63] that this 
strategy indeed allows for a very efficient usage of the instru- 
ment, in the sense that a large fraction of the FoV provides use- 
ful data. The upper limit in the apparent diameters furthermore 
ensures that the contribution of light from the galaxy at the posi- 
tion of the sky fibers is negligible (typically > 27.5 mag arcsec^ 
in the r-band). We thus avoid objects larger than the FoV, which 
would require a mosaicing strategy plus separate sky exposures 
leading to a dramatic reduction of the observing efficiency in 
terms of the number of galaxies observable within a given time. 
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Not all galaxies in the SDSS photometric catalog obeying 
our isophotal diameter criterion have spectra - and therefore red- 
shifts - in the SDSS spectroscopic database, which is known 
to become increasingly incomplete for total magnitudes brighter 
that r ~ 14. In order to (as much as possible) overcome such an 
undesirable bias against bright galaxies, we supplemented the 
SDSS redshifts with information accessed through the SIMBAD 
database at CDS, which in turn is a compilation of a large variety 
of observations and redshift catalogs. Hence, there are SDSS- 
based spectra and redshifts for -60% of the CALIFA galax- 
ies, while for the remainder we have only the redshift informa- 
tion provided by SIMBAD. As the latter is also not 100% com- 
plete, there could be a few galaxies within the CALIFA foot- 
print without redshift measurements and, therefore, outside of 
the CALIFA mother sample. It is difficult to quantify this in- 
completeness, but it is unlikely to be more than a few percent, 
and probably much less. 

Our decision to construct a diameter- selected sample has 
several practical advantages, besides the obvious benefit of effi- 
ciently using the instrumental field-of-view. Another advantage 
has already been mentioned: for the adopted redshift range, the 
distribution of apparent galaxy magnitudes naturally favours rel- 
atively bright systems, and in fact there was no need to define an 
additional faint flux limit to the survey (see Fig.|2]). Furthermore, 
the range in absolute magnitudes is considerably broadened due 
to the factor of 6 between lowest and highest redshifts, so that the 
CALIFA sample encompasses an interval of > 7 mag in intrinsic 
luminosities. In fact, the low-redshift cutofl' was mainly intro- 
duced in order to limit the luminosity range and avoid swamping 
the sample with dwarf galaxies, which - given the limitation to 
600 galaxies in total - were considered to be outside the main 
scientific interest of the CALIFA project. 

Together with the wide range in luminosities comes a broad 
coverage of galaxy colours, and the CALIFA sample includes 
substantial numbers of galaxies in all populated areas of the 
colour-magnitude diagram, from the red sequence through the 
green valley to the blue cloud (which is of course eff'ectively 
truncated at its faint end due to the low-redshift limit). This 
broad colour distribution is illustrated in Fig. [3] where we com- 
pare the w - z vs. relation of galaxies in the CALIFA mother 
sample with the corresponding d istribution in the SDSS-NYU 
catalog (e.g. Blanton et al. 2005 1 ). While there are (and should 
be) diff'erences in the details of the distribution - which can be 
quantified, see below -, it is immediately clear that CALIFA 
at least qualitatively represents a wide range of galaxy types. 
Figure 3] also provides the number of CALIFA objects per bin in 
Mz and u-z (recall, however, that the observed sample size will 
be smaller by a factor ~ 2/3). These numbers show that there 
will be sufficient statistics in several bins to make robust state- 
ments about typical galaxy properties, for early-type as well as 
late-type galaxies. In fact, these were the numbers that drove the 
overall time request for CALIFA to enable a total sample size of 
600 galaxies. 

The broad representation of galaxy properties in the CALIFA 
sample is also reflected in the distribution of morphological 
types. While a thorough characterization of the sample in terms 
of morphology and structural properties will be the subject of 
a future paper, a qualitative impression can be obtained already 
from rather simple diagnostics. It has been demonstrated in the 
] |2001| ) that bulge- and disk-dominated sys- 



past (Strateva et al. 



tems can be reasonably well distinguished by their concentration 
indices, defined as the ratio C of the tqq and rso Petrosian radii 
provided by the SDSS photometric catalog. Typically, a value of 
C ^ 2.8 requires the presence of a substantial bulge, whereas 
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Fig. 2. Apparent r-band magnitude and redshift distribution of 
the CALIFA mother sample. 



C ^ 2.3 is indicative of an exponential disk. In Fig.|3]we coded 
the symbols into three groups, including a class of transition or 
uncertain objects with 2.3 < C < 2.8. Their diff'erent distribu- 
tions in the colour-magnitude diagram is immediately apparent. 

Clearly, the majority of CALIFA galaxies (~ 2/3) have sub- 
stantial disk components, including irregulars and interacting 
galaxies, and are more or less actively forming stars. The fi- 
nal sample of ~ 400 of such galaxies will clearly exceed any 
previous IFU study by a large factor. Interestingly, the ominous 
"green valley" intermediate to star- forming and passive galaxies 
is well covered by the sample. 

On the other hand, CALIFA will also provide IFU data for 
some 200 bulge-dominated, morphologically early-type galax- 
ies, most of which - as expected - cluster very strongly 
along the red sequence . While the successful ATLAS^^ project 
(Cappella ri et al.|2011 ) has already observed an even somewhat 
larger number of early-type galaxies, CALIFA will complement 
the insights from ATLAS due to its much larger spectral cov- 
erage (~ 6x) and FoV, which will allow the study of the outer 
regions of early-type galaxies. 

In terms of environment, the CALIFA sample is clearly dom- 
inated by field galaxies. It will eff'ectively include galaxy popu- 
lations in groups, but much denser environments will be poorly 
sampled. A rough estimate of the total number of galaxy clu sters 
i n the sample can be obtained from the counts by Baillard et al.| 
leading to a maximum of 10 clusters that will, at least 
ly, be covered by CALIFA. Fortunately, the Coma cluster 



poll 



partia 

at z - 0.023 is fully covered by the CALIFA footprint and red- 
shift range (cf. Fig. [T]). Therefore, there will be some (limited) 
ability to study the environmental dependence of galaxy proper- 
ties. 

Any sample of galaxies faces the question about its ability 
to represent, in a statistically well-defined way, the properties 
of the galaxy population as a whole. In some cases, e.g. in the 
ATLAS^^ project, it has been possible to construct and observe 
strictly volume-limited samples for which this relation can be di- 
rectly made, but this is not a viable approach for a more generic 
survey covering a wide range of luminosities. Many surveys 
are simply flux-limited (such as the SDSS main galaxy sam- 
ple), possibly supplemented by an additional volume limit im- 
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Fig. 3. Distribution of the CALIFA mother sample in the u-z vs. 
Mz colour-magnitude diagram. The overplotted numbers indi- 
cate the number of galaxies in bins of 1 mag in Mr and 0.75 mag 
in w - z. Different colours and symbols represent a classification 
into bulge- and disk-dominated galaxies as well as intermediate 
cases, as suggested by the concentration index C. For compar- 
ison, the contours delineate the number density distribution of 
galaxies in the SDSS-NYU catalogue (e.g., Blanton et aL 2005). 



posed by redshift cuts, and statistical relations are then obtained 
by applying volume corrections to the apparent trends. One of 
the advantages of working with a diameter-limited sample as 
in CALIFA is the fact that volume corrections ca n then be ap- 
plied just as easily as for flux-limited samples (see, |Davies et al. 
|1990t |de Jong & van der Kruit|1995] , for extensive discussions 
of diameter-limited samples and volume correction). We will al- 
ways use such corrections for CALIFA when deriving statistical 
trends within the galaxy population. 

Of course, our ability to perform volume corrections does 
not imply that the CALIFA sample is free from selection bi- 
ases. Such biases may even occur directly as a consequence of 
the selection procedure. For example, the low-redshift limit of 
z > 0.005 causes low-luminosity galaxies to be missing from the 
CALIFA sample, with incompleteness setting in roughly around 
Mr ~ -20 and becoming severe for ^ -18. This type of 
bias is unavoidable when applying explicit redshift cuts. The 
degree of incompleteness can then be estimated by comparison 
with samples selected by other criteria. Nevertheless, there may 
also be more subtle biases afl'ecting our sample that we are cur- 
rently not aware of, and we will investigate this issue further. We 
also continuously check that the selection of CALIFA galaxies 
for observation - supposed to be essentially random, since it is 
based only on visibility during a given night - does not introduce 
spurious trends incompatible with the full sample. 

As already mentioned, a more detailed characterization of 
the CALIFA mother sample will be presented in a separate ar- 
ticle that is currently being prepared. That paper will provide a 
detailed analysis of the distribution of morphological and multi- 
colour properties, estimates of the survey selection function for 
several parameters, and in particular a compilation of archival 
multi- wavelength information. 



4. Observing strategy 

The observations of the CALIFA survey ofl&cially started in July 
2010, being performed at the 3.5m telescope of the Calar Alto 
observa tory with the Po tsdam Multi Apertur e Spectrograph, 
PMAS ([Roth et aL||2QQ5] ) in the PPAK mode ( [Verheijen et~ 
[20041 IKelz et aL||2QQ6] K The PPAK fiber bundle consists of 
382 fibers of 2.7 arcsec diameter each (see fig. 5 in |Kelz et al. ' 
2006). Of these 382 fibers, 331 (the science fibers) are concen- 
trated in a single hexagonal bundle covering a field-of-view of 
74''x64'', with a filling factor of -60%. The sky background 
is sampled by 36 additional fibers, distributed in 6 bundles of 6 
fibers each, along a circle ~ 72 arcsec from the center of the in- 
strument FoV. The sky-fibers are distributed among the science 
fibers within the pseudo-slit in order to have a good characteriza- 
tion of the sky, sampled with a similar distortion than the science 
fibers; the remaining 15 fibers are used for calibration purposes. 
Cross-talk between the adjacent spectra in the detector is esti- 
mated to be less than 5% when using a pure aperture extraction 
( Sanchez|2006a| ). Adjacent spectra on the CC D may be mapped 
to very diff'erent locations in the spatial plane ( Kelz et al.|2006 ). 
However, it introduces an incoherent contamination, not impor- 
tant for the present study, that is minimized by adopting a more 
refined extraction procedure during the reduction process. 

PMAS wa s upgraded with an E2V CCD231 4Kx4K in 
October 2009 d Roth et al.|2010 |), which has increased nominally 
the wavelength range covered by a certain instrumental setup by 
a facto r two, with respect to the values reported by |Roth et aL] 
(2005). However, this nominal increase is not fully met at the 
four corners of the CCD. As a result of a trade-off* study for 
the PMAS fiber spectrograph, the optical system was deliber- 
ately designed to tolerate some vignetti ng at the edges o f the 
full detector field-of-view, as reported by |Roth et al.| ( [1997) ), and 
Roth et al7| ( 1998 ). Figure |4]illustrates the eff'ect of vignetting for 



the new 4Kx4K CCD, which was not noticeable in the previous 
2Kx4K detector configuration. The left panel shows a map of 
the diff'erential fiber-to-fiber transmission for all the 331 fibers in 
the hexagonal fiber-bundle. The eff'ect of the vignetting is clearly 
identified as a drop in the transmission at the four edges of the 
map. 70% of the spectra are free from vignetting . However, 
30% of the spectra, namely those close to the edge of the de- 
tector, do suff'er some loss of efficiency that gradually increases 
towards the comers of the CCD. The loss of throughput amounts 
to up to 30% in the best case and 50% in the worst case, for one 
half of those spectra, but more than 50% for the other half. It 
can be seen in Fig. |4] (left panel), that where vignetting sets in, 
no more than a quarter of the full spectral range is aff'ected. In 
summary the vignetting severely aff'ects less than a 15% of the 
ffbers, in less than a 25% of their wavelength range. 

The right panel illustrates the spatial distribution of the vi- 
gnetting. The mapping of the PPAK ffbers from the telescope 
focal plane to the CCD implies that the areas aff'ected by the vi- 
gnetting correspond to an annular ring at about ~15'' from the 
center of the FoV. In addition, we report the existence of two 
broken ffbers with a lower transmission than the average (ffbers 
65 and 127), not reported before. 

A dithering scheme with three pointings has been adopted 
in order to cover the complete FoV of the central bundle and 
to increase the spatial resolution of the data. T his scheme was 
already used in previous studies using PPAK ([Sanchez et al. 



2007bt [Castillo-M orales et "aLl[20TQl [Perez -Gallego et al.|2010| 



Rosales-Ortega et al. 2010). In most of these studies the off'sets 



in RA and DEC of the diff'erent pointings, with respect to the 
nominal coordinates of the targets, were: 0'',0'' ; -hl.56'',-h0.78''. 
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Table 1. Summary of the objects observed in the first CALIFA runs 



Name^ 


RA (J2000)' 


DEC (J2000)' 


Redshift^ 




u-g^ 


M, 


Morph.^ 


NGC 5947 


15:30:36.00 


+42:43:01.00 


0.01965 


14.20 


-20.41 


1.57 


SBbc 


UGC 09892 


15:32:51.94 


+41:11:29.27 


0.01893 


14.83 


-19.70 


1.66 


Sb 


UGC 10693 


17:04:53.01 


+41:51:55.76 


0.02799 


13.59 


-21.79 


2.14 


E 


UGC 10710 


17:06:52.52 


+43:07:19.96 


0.02795 


14.52 


-20.86 


1.98 


Sb 


NGC 6394 


17:30:21.42 


+59:38:23.62 


0.02842 


14.60 


-20.81 


1.82 


SBb 


NGC 6411 


17:35:32.84 


+60:48:48.26 


0.01227 


12.99 


-20.59 


2.37 


E 


NGC 6497 


17:51:17.96 


+59:28:15.15 


0.02055 


13.80 


-20.88 


2.39 


SB(r)b 


NGC 6515 


17:57:25.19 


+50:43:41.24 


0.02284 


13.72 


-21.22 


2.03 


E 


UGC 11228 


18:24:46.26 


+41:29:33.84 


0.01935 


14.04 


-20.54 


2.03 


SBO 


UGC 11262 


18:30:35.69 


+42:41:33.70 


0.01862 


15.16 


-19.33 


1.82 


Sd 


NGC 6762 


19:05:37.09 


+63:56:02.79 


0.00976 


14.08 


-19.00 


1.82 


so/a 


UGC 11649 


20:55:27.62 


-01:13:30.87 


0.01265 


14.01 


-19.64 


2.29 


SB(r)a 


UGC 11680* 


21:07:41.33 


+03:52:17.80 


0.02599 


14.56 


-20.66 


2.89 


Sb 


NGC 7025 


21:07:47.33 


+ 16:20:09.22 


0.01657 


13.17 


-21.07 


2.28 


Sa 


UGC 11694 


21:11:52.02 


+ 11:16:34.11 


0.01699 


14.09 


-20.21 


2.18 


SO 


UGC 11717 


21:18:35.41 


+19:43:07.39 


0.02116 


14.68 


-20.09 


2.29 


S 


UGC 11740 


21:26:14.35 


+09:47:52.45 


0.02151 


14.70 


-20.11 


1.90 


S? 


NGC 7194 


22:03:30.93 


+ 12:38:12.41 


0.02718 


13.87 


-21.44 


2.02 


E 


UGC 12127 


22:38:29.41 


+35:19:46.89 


0.02745 


13.88 


-21.46 


2.56 


E 


UGC 12185 


22:47:25.06 


+31:22:24.67 


0.02217 


14.35 


-20.52 


2.03 


SBab 


NGC 7549 


23:15:17.26 


+ 19:02:30.43 


0.01573 


14.14 


-19.98 


2.01 


Sb? 



(*) This object has accurate data with the V500 setup only. 

^ Data obtained from the NED. 

^ Data obtained from the NYU-SDSS catalogue. 



+ 1.56'', -0.78''. These off'sets correspond to half the spacing be- 
tween adjacent fibers, which allows to cover the holes between 
fibers in the central bundle. However, in order to correctly ad- 
dress the problems generated by the vignetting reported before, 
we adopted a diff'erent dithering scheme for CALIFA, with wider 
offsets in RA and DEC, i.e.: 0",0" ; -5.22",-4.84" ; -5.22", 
+4.84". These off'sets correspond to jumping one fiber to the 
adjacent hole. This procedure ensures that in the final dithered 
and rebinned dataset the throughput losses for any spectrum af- 
fected by vignetting are compensated through the spectra from 
at least two adjacent fibers, closer than 2", which are unaff'ected 
by vignetting. A spatial recomposition of the three pointings is 
included in the standard data reduction scheme. 

All objects will be observed using two diff'erent and com- 
plementary setups. Firstly, the new grating, purchased for 
CALIFA and named V500, is used. It has a nominal resolu- 
tion of A/AA -850 at -5000 A (FWHM~6 A), to cover the 
widest possible wavelength range, nominally from [Oll]A3121 
to [SII]/16731 in the rest frame of all objects in the survey. 
Secondly, the V1200 grating ( |Roth et al.||2()05| ) is used, with a 
nominal resolution of .1/A^ -1650 at -4500 A (FWHM-2.7 A). 
It covers the blue wavelength range, including [0II]/13727, the 
Balmer break at -4000-4400 A, H^, Hy and [OIII]^4363. The 
main purpose of the first setup is to study the stellar popula- 
tions and the properties of the ionized gas using the widest pos- 
sible baseline of spectral features and emission line species. The 
observations with the second setup will provide accurate mea- 
surements of both the stellar and ionized gas kinematics, mostly 
through the H+K absorption features and the [OII]/13727 emis- 
sion line, respectively. 

In order to avoid strong eff'ects of the described vignetting 
on the final dataset, the wavelength range covered by each grat- 
ing was selected in a complementary way. The range covered 
by the V500 grating was selected such that the reddest emis- 
sion line of interest ([SII]/16731) was free of vignetting in all 



fibers. The range covered by the VI 200 grating was selected 
such that the bluest emission line of interest ([0II]/13727) was 
free of vignetting. This selection reduces the final wavelength 
range covered with each grating, but guarantees that all fea- 
tures of interest are unaff'ected by vignetting in at least one 
of the setups. A fixed setup was selected for each grating for 
the whole survey, to guarantee the homogeneity of the dataset. 
The final nominal wavelength range covered by each grating is 
3745-7300 A (V500) and 3400-4750 A (V1200), respectively. 
However, for the VI 200 grating, the wavelength range bluer than 
3700 A is of limited use, due to the sharp drop of the transmis- 
sion of the PPAK IFU fore-optics and fibers in this wavelength 
range. 

The exposure time is fixed for all targeted objects. For the 
V500 grating a single exposure of 900s per pointing of the 
dithering pattern is taken, while for the VI 200 grating 3 expo- 
sures of 600s each are obtained per position. The exposure times 
were selected based on our previous experience with the instru- 
ment (in particular |Marmol -Queralto 2011, a feasibility study 
of the current survey), and expectations about the performance 
of the V500 grating (which was unknown at the beginning of the 
survey). 

Data presented here were obtained during 16 nights (6 dark 
and 10 grey) in June- July 2010. A total of 20 objects were ob- 
served in both setups (1 more was observed only with the V500 
grating). Table [T] lists the observed objects, including their co- 
ordinates, redshift, SDSS g-band observed and absolute magni- 
tudes and u- g color and their morphological classification (ex- 
tracted from the NASA/IPAC Extragalactic Databas^. These 
objects are described here since we will use them to illustrate 
the current status of the data acquisition and its quality. 



http://nedwww.ipac.caltech.edu/ 
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Fig. 4. Left panel: Intensity map of the differential transmission fiber-to-fiber for the V500-grating (known as fiber-flat), correspond- 
ing to the night of the 10th of June 2010. The efl'ects of vignetting are clearly visible with a significant drop of the transmission at 
the edges of the image. Right panel: Spatial distribution of the fraction of pixels free of vignetting, when adopting a conservative 
criterion of 75% of the average transmission in the central fibers. Two fibers at the bottom/right show a transmission below this limit 
in the whole wavelength range. 



5. Data reduction 

The reduction of the CALIFA data is performed using a fully au- 
tomatic pipeline developed ad-hoc. This pipeline operates with- 
out human intervention, producing both the scientifically useful 
frames and a set of quality control measurements that help to 
estimate the accuracy of the reduced data. The pipeline uses the 
routines included in the R3D package ( Sanchez2006a) and the 
E3D visualization tool ( Sanchez et al. 2004 ) , following the stan- 
dard s teps for fiber-based IFS (e.g., Sanchez|2006at Sandin et al. 
2010| ). The pipeline will be upgraded and improved during this 
project on the basis of the quality tests performed on the acquired 
data. Here we present a summary of the current implementation 
of the data reduction. 



5.1. Removal of electronic signatures and realignment of the 
frames 

The CCD used for CALIFA is a new 4kx4k E2V detector in- 
stalled in the instrument in October 2009. In contrast to the old 
2kx4k one (Roth et al. 2005), the read-out software for the new 
CCD stores each frame in four diff'erent FITS files, correspond- 
ing to each of the four amplifiers of the detector, each one with 
diff'erent bias and gain levels. As a first step, the four files are 
combined into a single frame, after subtracting the correspond- 
ing bias level, and transforming from counts to electrons on the 
basis of the corresponding gain. A master bias frame is created 
by averaging all bias frames observed during the night, in order 
to cross-check the stability of the bias level. In contrast to the old 
CCD, the new CCD does not exhibit any detectable structure in 
the bias frame. 

The next step is clipping cosmic rays and combining diff'er- 
ent exposures taken at the same position on the sky. However, 
prior to combination, possible off'sets between the projected 
spectra in the CC D due to flexure are estimated. As shown in 
Kelz et aLldSOOS'), 



the time. On the other hand, calibration frames are taken within 
a maximal range of 1.5 hours from the science frames. This en- 
sures that in most cases the flexure pattern of the calibration 
and science frames is the same and, therefore, it is possible to 
trace the spectra projected in the CCD and extract them prop- 
erly. However, in a few cases (e.g. when the targets are observed 
when setting), there could be small linear ofl'sets between the 
science and the calibration frames. To guarantee a correct han- 
dling of flexure, these ofl'sets are estimated based on the position 
of several (-50) spots in the raw data. They correspond to the 
Hg emission lines of the arc lamps on one cal ibration frame, 
and the night- sky light pollution Hg emission ( Sanchez et al. 
2007a) on the science frames (including Hg/14047, Hg/14358 
and Hg/15461). A linear shift is applied to the frame, when a 
significant off'set (>0.1 pixel) is detected among the diff'erent 
exposures, and between them and their corresponding calibra- 
tion frames (continuum and arc lamp exposures). This was the 
case in ~2% of the frames taken during the first observing runs. 
Finally, a cosmic-ray cleaning algorithm is applied if only one 
single frame is taken at a certain position. The actual cleaning 
algorithm is based on the Lapla cian Edge detection algorithm 
described in van Dokkum ( 2001| ), recently implemented in R3D. 



flexure is a major issue when working with 
PRAK. In general, the amplitude of the flexure is stronger while 
targets are setting. However, in a large survey like CALIFA, it 
will not be feasible to avoid observing setting targets 100% of 



5.2. Spectral extraction, wavelength calibration and fiber 
transmission correction 

The locations of the spectra on the CCD are determined using a 
continuum-illuminated exposure taken before the science expo- 
sures. Each spectrum is then extracted from the science frames. 
In order to reduce the efl'ects of cross-talk we do not perform 
a simple aperture extraction, which would consist of co-adding 
the flux within a certain number of pixels of the location derived 
from the continuum-illuminated exposure. R ather, we adopt a 
modified version of the Gaussian suppression ( Sanchez|2006a ), 
first used in the reduction of th e PINGS da t a ([Rosales-Ortega 
|et al.||2010 ), and described in Sanchez et al."(2011). The cross- 
talk is reduced to less than 1% when adopting this new method. 
This procedure relies on an accurate internal focus of the in- 
strument. The quality of the instrumental focus, which depends 
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Fig. 5. Flux density-to-counts ratio for the gratings used in the 
CALIFA survey, derived for the most transparent night of those 
observed so far, as compared to the most frequently used grat- 
ings for PMAS (V300 and V600). 



on the internal temperature of the structure of the instrument, is 
controlled by fitting a Gaussian to a subset of the projected con- 
tinuum illuminated spectra along the cross-dispersion axis, and 
comparing the results with the nominal value for the instrument 
(-2.5 pixels). 

The extracted flux, for each pixel in the dispersion direction, 
is stored in a row- stacked- spectrum file ( S anchez et al. 2Q04j ). 
Wavelength calibration is performed using HeHgCd lamp ex- 
posures obtained before and after each pointing. The number of 
identified emission lines used to perform the wavelength calibra- 
tion ranges between 11 (for the VI 200 grating) and 16 (for the 
V500 grating), homogeneously distributed along the wavelength 
range covered. It was found that the adopted procedure yields an 
accuracy of about ten percent of the nominal pixel scale in the 
wavelength calibration (i.e. , ~ 0.2 A for the V5 00 grating and 
~ 0.1 A for the VI 200 one). Diff'erences in the relative fiber- 
to-fiber transmission throughput are corrected by comparing the 
wavelength-calibrated row- stacked- spectra science frames with 
the corresponding frames derived from sky exposures taken dur- 
ing the twilight. These frames are also used to determine which 
pixels at the edge of the CCD are aff'ected by the previously de- 
scribed vignetting eff'ect. Those pixels for which the transmis- 
sion drops to < 75% are masked. 

5.3. Sky subtraction 

PPAK is equipped with 36 fibers to sample the sky, distributed 
around the science fiber-bundle, in six small bundles of six fibers 
each, at a distance of -75'' from the center of the FoV (see 
Kelz et al. 2006). By construction, the objects selected for the 
CALIFA final sample cover a substantial fraction of the FoV of 
the central PPAK bundle, with most of the sky fibers free from 
emission by the corresponding target. The procedure adopted to 
derive the night-sky spectrum is to combine the spectra corre- 
sponding to these fibers, performing a 2cr clipping rejection to 
remove any possible contamination from low surface brightness 
regions of the galaxy and/or projected companions. Once ob- 
tained, the sky spectrum is subtracted from all spectra of the 
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Fig. 6. Relative diff'erences, in percent, between the response 
curves found on diff'erent nights for the V500 grating dataset 
(top panel) and the VI 200 grating (bottom panel), as a func- 
tion of wavelength, normalized to unity at 4600 A and 4150 A, 
respectively. The solid hashed region (and red solid line) shows 
the Icr error, while the dashed hashed region (and black dashed 
line) shows the 2cr error. 



corresponding frame. The accuracy of the subtraction is con- 
trolled by comparing the equivalent width of the most prominent 
night- sky emission lines ([Sanchez et al.|2007a), before and after 
the subtraction for each individual spectrum within the frame. 
In addition, each derived night-sky spectrum is used to derive 
the night- sky brightness at the location and time when the tar- 
get is observed, by convolving the spectrum with the transmis- 
sion curves of the Johnson V-band (5-band) filter for the V500 
(VI 200) setups. This value is used to control the actual condi- 
tions when the data are acquired. 

5.4. Flux Calibration 

Flux calibration is performed by comparing the extracted spec- 
tra of spectrophotometric standard stars from the Oke Catalogue 
(Oke 1990), observed during each night, with their correspond- 
ing flux-calibrated spectra, publicly available on the webpage 
of the observator}]^ For the currently observed nights, the stars 

^ http ://w w w. caha. es/pedraz/S S S/s s s . html 
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Fig. 7. Comparison of the central 5'' spectrum extracted from the V500 (black, thick solid line) and V1200 (red, dashed line) 
grating datacubes for three typical objects (from left to right): NGC 6411, the brightest object observed so far; NGC 6497, a spiral 
galaxy with an average brightness, and UGCl 1262, the faintest target observed so far. The green thin solid line shows the difference 
between both spectra. The orange hashed region indicate the ±10% median intensity range of both spectra around the zero level, 
indicated with a grey continuous line. 



were: BD+25d4655, BD+28d4211 and BD+33d2642 (although 
any suitable star from the list will be used in the course of the 
survey). When feasible, two different calibration stars are ob- 
served in the same night. Stars are not observed using the dither- 
ing pattern. A finite aperture of r <W was selected after several 
experiments, trying to maximize the covered flux and the signal- 
to-noise. This finite aperture, and the incomplete coverage of the 
FoV by the PPAK fiber bundle, produce a systematic offset of 
15% in the flux calibration, estimated on the basis of simulated 
extractions of stellar images with a typical seeing of -T'. This 
offset is well known, and it has already been reported by other 
authors (e.g. PINGS, Rosales-Ortega et al. 2010). This system- 
atic offset has been corrected prior to any further analysis. 

R3D includes a procedure that finally provides the transfor- 
mation function from observed counts to intensity, taking into 
account also the airmass and extinction of the observations of 
both the spectrophotometric standard stars and the science tar- 
gets. To apply this procedure we adopted the extinction mea- 
sured by the Calar Alto Vistual Extinction monitor (CAVEX) 
at the moment of the obs ervations, and the av erage extinction 
curve for the observatory ( Sanchez et al.|2007a ). The procedure 
ensures a good relative flux calibration from the blue to the red 
part of the spectra, if the weather conditions throughout the night 
are stable. However, an absolute offset between the derived and 
real fluxes is expected if the weather conditions varied during 
the observations. If more than one standard star is observed, the 
pipeline performs a comparison between the transformations de- 
rived using each of them, in order to estimate the photometric 
stability of the night considered. 

Figure [5] shows the derived transformation function from 
counts to flux density for both the VI 200 and V500 gratings 
(corrected for the offset in resolution), together with those for 
the most frequently used gratings for PMAS (the V300 and V600 
ones). These transformation functions were derived by selecting 
the curves for the most transparent nights included in the cur- 
rent study, together with the ones derived in a previous study 
(|Marmol-Queralto|[20TT| for the V300 and V600 gratings). It 



should be noted that there was a major update in the instru- 
ment, with an exchange of the CCD, between the observations 
taken with the V300 grating and the other observations. The off- 
set between the efficiency found for this grating and the remain- 
ing gratings can easily be explained by the improvement in effi- 



ciency of the new CCD. Otherwise all efficiency curves are quite 
similar. This basic conclusion would not change if instead of us- 
ing the best curve, we had selected the mean or median transfor- 
mati on function. The se results are in agreement those reported 
by R oth et al. (2005), where it was stated that all the V-gratings 
show basically the same efficiency (after correcting for the reso- 
lution effects). 

Figure [6] shows the dispersion between the different response 
curves found for different calibration stars and/or nights as a 
function of the wavelength for the V500 and VI 200 gratings, 
normalized by the offset in the zeropoints at 4600 A and 4150 A, 
respectively. The differences between one response curve and 
another as a function of the wavelength reflect the intrinsic dis- 
persion of the flux calibration. The flux calibration obtained by 
applying this response curve is just a relative one. An absolute 
calibration is obtained by rescaling using a factor derived after 
the comparison with broad-band imaging, as described in forth- 
coming sections. On average, the relative flux calibration shows 
a dispersion of -2-3% (-0.025 mag), for wavelengths redder 
than of 3850 A. At shorter wavelengths, the error in the accu- 
racy of the relative flux calibration becomes worse, being about 
-8% (-0.08 mag) at -3700-3750 A. These values are similar to 
the ones found in classical slit spectroscopy. 

5.5. Spatial re-arranging and image reconstruction 

Once the frames corresponding to each particular pointing are 
reduced, the science spectra corresponding to the three dithered 
exposures are combined in a single frame of 993 spectra. To 
do so the images are rescaled to a common intensity and re- 
sponse function by comparing the integrated spectra within an 
aperture of 30'Vdiameter. This procedure takes into account pos- 
sible variations in the atmospheric transmission during the sci- 
ence exposures. In addition, the procedure produces a final po- 
sition table, where the individual position tables (corresponding 
to the PPAK central bundle) are combined using the offsets pro- 
vided for the dither. Finally, the data are spatially resampled to 
a datacube with a regular grid. This is done by adopting a flux- 
conserving variation of Shepard's interpolation method ( Shepard] 
1968 ). This method is a very simple and robust tessellation-free 
interpolation of scattered data, where the intensity at each inter- 
polated point is the sum of a weighted average of the intensities 
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Fig. 8. Distribution of the rms of the difference between the nominal and measured wavelengths of the night-sky emission lines, 
derived from the statistical analysis of all the considered emission lines within the same science frame for the V500 (left panel) and 
VI 200 (right panel) grating datasets. The mean shift is around zero. 



corresponding to n adjacent scattered points (the original spec- 
tra) within a certain boundary distance (rum): 



k-n 



k^l 

where F{i, j) is the reconstructed intensity in the pixel (/, j) of 
the final datacube at a certain wavelength, . is the weight at 
this pixel of the adjacent spectrum k, and fk is the intensity of 
this adjacent spectrum, at the considered wavelength. 
The weights are derived by a Gaussian function: 

w = Nexp[-0.5(r/crf] 

where is a normalization parameter, r is the distance between 
the pixel (/, j) and the spectrum k, and cr is the parameter that 
defines the width of the Gaussian function. The normalization 
parameter is derived for each interpolated pixel, being defined 
as the inverse of the sum of the diflferent weights of the spectra 
contributing to this pixel: 



N(iJ) = 



1 



n...n < nil 



This normalization guarantees that the integrated flux is pre- 
served. The adopted image reconstruction assumes a boundary 
limit of riim =5'', cr =V\ and a final pixel scale of TVpixel for 
the resulting datacube. Prior to this interpolation, the flux corre- 
sponding to those spectral pixels masked due to the vignetting 
are replaced by the average of the fluxes at the two nearest fibers 
not aff'ected by this eff'ect. Finally, the flux intensity is cor- 
rected by the difl'erence in aperture between the original fibers 
(~2.7'7diameter) and the final pixels. 

5.6. Differential Atmospheric Refraction 



The diff'erential atmospheric refraction (DAR, |Filippenko| 1 982| ) 
is corrected once the data are spatially resampled to a datacube 
with a regular grid. In the case of IFS data, the DAR can be 
corrected empirically, after the observations, without requiring 
to know the original orientation of the instrument and without 



the need of a compensa tor ( [Emsellem et al.|1996t [Arribas et al. 
|1999t|Roth et al.||2004| ). To do so, the reconstructed datacubes 
are thought of as a set of narrow-band images with a band- width 
equal to the spectral resolution. These images can be recentered 
using the theoretical off'set determined by the DAR formulae 
( Filippenko„ 1982), or, as in 2D imaging, tracing the intensity 
peak of a reference object in the field-of-view (or a DAR ref- 
erence observation) along the spectral range, and recentering it. 
Note that this latter approach is basically unfeasible in slit spec- 
troscopy, this being one of the fundamental diff'erences between 
the two methods. 

We adopted this latter empirical correction for the CALIFA 
pipeline, using the corresponding tools included in R3D. For do- 
ing so, the centroid of the observed object (i.e., the galaxy) is 
derived at each wavelength, by determining its barycenter in an 
image slice extracted from the datacube, within a range of 20A 
around the considered wavelength. Then, the shifts along the 
wavelength are determined by comparing the corresponding co- 
ordinates to a common reference. A polynomial function of or- 
der 3 is fitted to each shift (X and Y) along the wavelength to in- 
crease the accuracy of the determined off'set (e.g. Wisot zki et al. 
p003 ). Then, the full datacube is shifted by resampling and shift- 
ing each image slice at each wavelength. In general, the empir- 
ical correction produces similar results to the theoretical one. 
However, the theoretical correction is not well defined when 
different pointings of the same object are observed during 
different nights, due to weather restrictions (-15% of the ob- 
served targets). These observations may span over different 
dark-time periods. The use of the empirical correction guar- 
antees the homogeneity of the treatment of DAR correction 
for all the datasets. 



5. 7. Absolute flux re-calibration 

In order to get the best possible absolute flux calibration, we re- 
calibrate our data based on SDSS photometry, available for all 
targets by construction of the sample. Of the five SDSS filters 
(ugriz), two namely, the g (A^ff =4770 A) and r (/leff=6231 A) fil- 
ters, are covered almost completely by the V500 grating data, 
while u (Aqs =3594 A) is only partially covered by both the 
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Fig. 9. Left panel. The distribution of the FWHM of the instrumental dispersion derived from the analysis of the emission lines 
within the arc-lamp frames at different wavelengths and different positions within the FoV, for the VI 200 data. Right panel: The 
spatial distribution of the average FWHM of the instrumental dispersion, derived from the same arc lines. 



V500 and VI 200 grating data. Therefore, we use the V500 grat- 
ing data and the g and r band SDSS images to perform a primary 
flux recalibration. 

To perform the primary recalibration, we measure the counts 
of each galaxy in the SDSS image inside a 30'' diameter aper- 
ture. These counts are converted to flux density following the 
counts-to-magnitude prescription in the SDSS documentatiorj^ 
The accuracy of this photometry was cross-checked by obtain- 
ing the magnitudes of any detected star in each field, using the 
same procedure, and comparing them with the values listed in 
the SDSS DR6 photometric catalogue. We obtained less than 
0.05 magnitude dispersion for the stars with g < 17.5 mag. Once 
we ensure the accuracy of this photometry, we extract the spec- 
trophotometry from the reduced datacubes corresponding to the 
V500 grating setup, coadding the flux of individual spectra in- 
side a 30'' diameter aperture and convolving this spectrum with 
the SDSS g a nd r filter passbands (ADPS database, |Moro &| 
MunaripOOO I. Using these two data pairs, a scaling solution is 
found, by adopting the average of the flux ratio in both bands. 

Once the V500 datacube has been recalibrated, a new 
scaling solution for the VI 200 data is derived by comparing the 
5" aperture extracted spectra around the central position of each 
object from these cubes with those extracted from the original 
reduced VI 200 data. In this case, a low-order polynomical 
function is fitted to the ratio between both spectra within the 
common wavelength range (~3745-4770 A). The result of this 
polynomial fitting is then applied to the VI 200 data, to match 
them to the V500 ones. Figure [7] shows the central aperture 
extracted spectra for both setups together with the difference be- 
tween them (5 "/diameter), for three typical targets. It illustrates 
the agreement between both datasets, once the recalibration 
procedure has been applied. The VI 200 spectra have been 
convolved with a Gaussian function to compensate for the 
difference in resolution, for the purpose of this figure only. The 
typical difference between the spectra is -10% of the average. 
The strongest differences are located at the wavelengths of the 
more significant spectral features (absorption and/or emission 
lines). 



^ http ://w w w. sdss . org/dr6/algorithms/fluxcal.html 



We conclude that the current version of the pipeline, which op- 
erates automatically, is able to reduce the data, producing the re- 
quired data cubes to perform early experiments and estimations 
of their quality. The pipeline will be continuously upgraded, pro- 
ducing different versions of the data, on the basis of the results 
of the different foreseen quality controls, prior to releasing the 
data to the community. In the next section we describe the early 
tests implemented to estimate the quality of the data. In some 
cases these tests have already induced slight modifications in the 
pipeline, as we will describe below in detail. 

6. Quality of the data 

In parallel to the reduction of the data, the pipeline performs a set 
of automatic tests, which are stored during the reduction process, 
creating a set of tables and figures. These allow the CALIFA 
team to check the quality of the data and to identify possible 
problems in the data reduction. In this section we present the 
basic results of the currently implemented quality controls on 
the data acquired so far. We expect the quality control to become 
more comprehensive with time. 

6.1. Accuracy of the wavelength calibration 

In general, the wavelength solution is found with an accuracy of 
the order of 10-15% of the nominal pixel scale, i.e., the rms of 
the fit with the low order polynomial function is ~0. 2-0.3 A for 
the V500 grating and -0.1-0.2 A for the V1200 grating. 

In order to obtain an independent estimate of the accuracy 
of the wavelength calibration of the science frames, we compare 
the nominal and recovered wavelengths of the most prominent 
night-sky emission lines in each individual spectrum prior to the 
sky subtraction. The central wavelength of these lines was deter- 
mined by fitting a Gaussian function to each line. This provides 
us with 331 estimations of the relative offsets between both val- 
ues for each considered night- sky line. For the V500 grating we 
use the HgU5461, [OI]^5577, NaD^5893 and [OI]^6300 lines, 
while for the V1200 grating we use the HgI/14046 and HgI/14358 
ones. Only measurements derived using emission lines with a 
signal-to-noise higher than 10 are considered. Finally, a few hun- 
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Fig. 11. Comparison of the g-band magnitude derived from the 
integrated spectrum within an aperture of 3C diameter extracted 
from the V500 datacubes and the corresponding magnitudes de- 
rived from the SDSS images, before (blue solid circles) and after 
(red solid squares) applying the flux recalibration procedure de- 
scribed in the text. 



dred measurements of the diff'erences between the nominal and 
recovered wavelengths are retained for each frame correspond- 
ing to both gratings. A simple statistical analysis allows us to 
estimate the final accuracy of the wavelength calibration, deriv- 
ing both the median off'set (if any) and the rms of each dataset. 

No mean shift was found between the nominal and mea- 
sured wavelength for the diff'erent analyzed sky lines. Figure [S] 
shows the distribution of the rms of the diff'erence between the 
nominal and measured wavelengths of the night-sky emission 
lines, found for each science frame, for both setups (left and 
central panel). In both cases the values range between 0.01 A 
and 0.35 A. The resulting median value is very similar for both 
distributions, being around ~0.1 A, which corresponds to ~8 km 
s"^ at the wavelength of [OII]/13727 and ~3 km s"\ at the wave- 
length of Ha. 



6.2. Empirical spectral resolution 

The spectral resolution of a dataset is limited at best to the nom- 
inal instrumental resolution. However, diff'erent eff'ects, such as 
problems with, or degradation in, the internal focus of the in- 
strument, errors in the tracing/extraction process or in the wave- 
length calibration may aff'ect the final spectral resolution. 

In order to estimate the real spectral resolution for each sci- 
ence frame, we fit the strongest night-sky emission lines (see 
Section 6.1 ) in each individual science spectrum and before sub- 
tracting thejiight-sk}^^ with a Gaussian function using 
FIT3D ( Sanchez et al.|2006b ). Night-sky lines are unresolved at 
our resolution, thus their widths provide good estimates of the 
final resolution at their respective wavelengths. The median and 
standard deviation of the FWHMs are stored for each science 
frame. 

The spectral resolution found for the VI 200 grating is very 
similar to the nominal value (~2.7 A), and is consistent with the 
values reported by Roth et al. (2005). For the V500 grating on the 
other hand, there is no published information about its nominal 
spectral resolution, which was estimated to be ~6 A (Roth, pri- 
vate communications). The empirical resolution estimated from 



night-sky lines is ~6.5 A. These instrumental spectral resolu- 
tions correspond to velocity resolutions of cr ~85 km s"^ and 
o- -150 km s-i for the V1200 and V500 datasets, fulfilling the 
requirements of the survey, for both setups. 

The same experiment can be repeated using the arc-lamp 
frames obtained during the night instead of the science frames. 
This procedure has the advantage that all selected lines have high 
signal-to-noise, and are evenly distributed over the full spectral 
range. However, the derived instrumental dispersion cannot be 
directly compared with the science data, since the arc-lamp ex- 
posures are an order of magnitude shorter in time and, there- 
fore, they are not aff'ected by the degradation in resolution due 
to the drift of the spectra on the CCD in long exposures (due 
to the change of the flexure pattern). For this experiment, a dat- 
acube for each reduced arc lamp frame was created following 
the prescriptions for science frames. Then, for each spectrum of 
this datacube, we fit each emission line with a single Gaussian 
function, deriving the central wavelength, intensity and width 
(in FWHMs). These latter values are considered as an estimate 
of the instrumental dispersion. As expected the derived values 
are smaller (and more precise) than thos reported above, be- 
ing 2.30+0.11 A for the V1200 (ranging between 2.2-2.4 A) 
and 5.65+0.21 A for the V500 (ranging between 5.3-6.1 A). 
However, the final resolution achieved in the data is more simi- 
lar to that estimated from the night- sky lines, than that estimated 
from the ARC-lamps, which are observed in shorter integration 
times. 

Although not clearly appreciated in the measurements per- 
formed using the night- sky lines, there is a clear spatial and 
wavelength dependence of the instrumental resolution. Figure |9] 
shows both distributions, for the VI 200 data (a similar result is 
found for the V500 data). The left panel shows the distribution 
with wavelength, which shows the larger diff'erence, i.e. ~13%. 
At each wavelength we plot values found at diff'erent locations 
within the FoV, shown in the right-panel. The amplitude of the 
spatial pattern is ~5%. The pattern seen in the spatial distribu- 
tion, which has a maximum in an annular ring at -15'' from the 
center, and an underlying gradient, is a consequence of the cor- 
respondence between the spatial distribution of the PPAK fibers 
within the FoV and their positions along the entrance pseudo- 
slit of the spectrograph. The fibers located in the described ring 
correspond to those at the edges of the pseudo-slit. This implies 
they also lie at the edges of the CCD, where the resolution is 
worse. The underlying gradient is possibly an eff'ect of a slight 
tilt of the focal plane of the CCD with respect to the focal plane 
of the spectrograph camera. 

The correction of these instrumental signatures were not 
originally considered in the version of the pipeline described in 
the Section |5] These data allowed us to understand the changes 
in resolution along the wavelength and across the field. We 
found no clear dependencies of the instrumental resolution with 
the positioning of the telescope, at least for the range of air- 
masses at which we observe the targets (< 1.4). There is a clear 
dependency with the temperature of the instrument, which af- 
fects its internal focus. This is corrected by the focus procedure 
adopted every night, and a careful monitoring of this tempera- 
ture (changes larger than -0.5° C are rare, along a single night). 
However, both eff'ects produce mostly a change in the global 
value, rather than a change in the described pattern, which seems 
to be rather stable. 

Based on these results, we implemented a procedure in the 
pipeline to correct for the diff'erences in the spectral resolution 
across the FoV and along the spectral range by fixing it (in terms 
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Fig. 10. Distribution of the relative residual flux after subtracting the night-sky spectrum for the strongest emission lines on the 
corresponding wavelength range covered by each setup: [0I]/15577 for the V500 grating (left panel) and HgI/14358 for the V1200 
grating (right panel). 
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Fig. 12. Top panel: Comparison of the published flux-calibrated 
spectrum of a calibration star (black, thick solid line), with 
the flux re-calibrated spectrum extracted from the V500 data 
(red, dashed line). The green line shows the residuals, once cor- 
rected by the difl'erent resolution of each spectra. Bottom panel: 
Distribution of the residuals in intensity, pixel-to-pixel, for all 
the calibration stars observed along the run. The standard de- 
viation is lower than ~5% for the considered wavelength range 
(3800-7000A). 
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Fig. 13. Color image of the signal-to-noise map derived for the 
V500 data corresponding to the continuum emission of the NGC 
5947 galaxy at the wavelength range between 4480 and 4520 A, 
based on the rough estimation described in the text. The contours 
shows the intensity of the V-band image synthesized from the 
datacube, with the 1st contour at 1.5 10"^^ erg s"^ cm"^ arcsec"^ 
(-23.5 mag/arcsec^) and consecutive ones with a step of 3 10"^^ 
erg s"^ cm"^ arcsec"^. 



of FWHM) to the worst value found for each dataset (based on 
the measurements of the FWHM in the ARC-lamp frames). For 
doing so, the described derivation was repeated for each arc- 
lamp exposure, associated with each science frame. Then a low 
order polynomial function is fitted to the derived values of the 
resolution, per spaxel and per pixel. Finally, the diff'erential res- 
olution is derived by obtaining the quadratic diff'erence between 
the estimated value at each pixel and the globally worst value. 



13 



S.R Sanchez et al.: CALIFA, the Calar Alto Legacy Integral Field Area survey: 



The science spectra are then convolved pixel-to-pixel by this dif- 
ferential resolution, normalizing it along the spectral range and 
across the FoV. The final resolution achieved is slightly worse 
than the one described above, due to flexure-related shifts in 
wavelength: ~2.7A for the V1200 and ~6.5A for the V500, as 
derived from the measurements of the night-sky emission lines 
width. However, with the adopted procedure the resolution is 
normalized along the spectral range and across the field. 

Any further analysis described in this article is based on 
these spectra, with a homogeneous spectral resolution (in terms 
ofFWHM). 

6.3. Accuracy of the sky subtraction 

The determination of the night- sky spectrum and its subtraction 
from the data is an issue in many IFUs, mostly due to the small 
FoV and the resulting lack of a pure night-sky spectrum. By de- 
sign the CALIFA sample galaxies have an optical size that fits 
within the FoV of the central bundle of the PPAK IFU, while 
most of the 36 sky fibers are free of any contamination from 
galaxy light. The procedure adopted to subtract the sky is there- 
fore rather simple, as explained above (Section [53 ). 

In order to estimate the accuracy of this procedure, we 
measured the flux corresponding to the strongest night- sky 
lines before and after subtracting the estimated night- sky spec- 
trum. These lines are [OI]/15577 A for the V500 grating and 
Hg/14358 A for the VI 200 grating. Their typical intensity 
ranges between 1-6 10"^^ erg s"^ cm"^, measured from adding 
the flux within a range of 15 pixels around their nominal wave- 
lengths, and subtracting the flux in two adjacent continua of the 
same width, at shorter and longer wavelengths. After subtracting 
the night- sky spectra the typical residual value ranges between 
±0.5 10"^^ erg s"^ cm"^ for 99% of the spectra, i.e., the sky is 
subtracted with an accuracy of 1-5%, with a typical residual of 
less than 2%. Figure 10 shows the distribution of intensities ob- 
tained for each considered line once the night-sky spectra have 
been subtracted. 

The reduction pipeline stores the actual night- sky spectrum 
subtracted from each science frame, to allow inspection in more 
detail if needed. In particular, the night-sky surface brightness 
is measured by the pipeline and stored as a parameter to eval- 
uate the quality of the data. To do so, each night- sky spectrum 
is convolved with the 5-band (V1200) and V-band (V500) fil- 
ter response curves obtained from the ADPS database (Moro 
& Munari 2000), and then the corresponding surface brightness 
was estimated, applying the zero points listed in Fukugita et al. 
(1995). For the currently observed data the median values of 
the derived night-sky surface brightness are within -0.5 mag of 
the typical values for a dark night at Calar Alto ( |Sanchez et al.| 
|2Q07aj ). 



6.4. Accuracy of the flux calibration 

The selected observational and reduction strategy uses the spec- 
trophotometric calibration stars observed during the nights to 
perform an initial flux calibration. The main aim of this flux cal- 
ibration is to correct for the relative transmission of the system 
from blue to red, and provide a preliminary absolute zero point. 
As indicated in Sec. [5] the accuracy of the relative transmission 
is better than a few percent in most of the covered wavelength 
range, which is sufficient for the science goals of this survey. On 
the other hand, the accuracy of the absolute spectrophotomet- 
ric calibration was -24%, on average, for the clear/photometric 



nights, and it can be worse for data taken under non-photometric 
conditions. Therefore a flux recalibration procedure is imple- 
mented in the pipel ine, anchoring our photometry to that of 
the SDSS data (Sec. |5.7| ). It is expected that the final absolute 
spectrophotometric accuracy improves to ~8% photometric er- 
ror, on the basis of our previ ous experience with the PINGS data 
( Rosales-Qrtega et al.|2QTQ| ). 

We cross-check the improvement in the spectrophotometric 
calibration by extracting from the datacubes the integrated spec- 
tra within an aperture of 30'' diameter around the peak intensity 
of the object, before and after the flux recalibration. The cor- 
responding g and r-band magnitudes were derived from these 
spectra, following the procedures described in Sec.|5] The mag- 
nitudes derived from the spectroscopy were then compared with 
the corresponding ones obtained from the SDSS images, adopt- 
ing a similar aperture, and following the same procedure as used 
to recalibrate the data. 



Figure [TT] shows the distribution of the g-band magnitudes 
derived from the datacubes and the SDSS images before (blue 
solid circles) and after (red solid squares) the recalibration pro- 
cedure. As expected, there is a significant improvement in the 
photometric agreement. Prior to the recalibration, both photo- 
metric datasets match within -20%. This dispersion is similar to 
that found in the zero points of the flux calibration derived night- 
by-night for the V500 data. Once the recalibration was applied, 
both datasets match each other within -0.06 mag, i.e., -7% (Fig. 
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red squares). Similar results are found for the r-band. 
To reinforce the validity of our flux re-calibration proce- 
dure, we repeated it for the flux-calibration standard stars ob- 
served during the run. The frames corresponding to the calibra- 
tion stars were reduced using the same procedure as used for the 
science objects. Instead of using the transmission curve derived 
for the corresponding night, we adopted the master curve shown 
in Fig [5] and the extinction information provided by the extinc- 
tion monitor. With this procedure we try to simulate as closely 
as possible the reduction conditions of the science frames. Next, 
to apply the same re-calibration procedure outlined before, the 
g and r-band magnitudes were synthesized from the published 
flux-calibrated spectra. The top panel of Figure \T7\ shows a typ- 
ical example of how the flux-recalibrated spectrum correspond- 
ing to a calibration star agrees with the published flux-calibrated 
one. The bottom panel shows the distribution of the relative dif- 
ferences in intensity, pixel-to-pixel, between the published and 
re-calibrated spectra, for all the calibration stars observed along 
the run. The standard deviation is below -5% for the considered 
wavelength range (3 800-7000 A) 

6.5. Signal-to-noise and depth of the data 

The pipeline performs a rough estimation of the S/N in each in- 
dividual spectrum within the final reduced datacube. The median 
and the standard deviation of the intensity values are computed 
over a wavelength range free from strong spectral features. The 
wavelength range selected to perform this analysis was 4480- 
4520 A for both gratings, in order to simplify the comparison 
between the derived values. Assuming that the scatter is en- 
tirely due to noise, which is a rather conservative assumption, the 
signal-to-noise per spatial pixel is estimated as the ratio between 
the standard deviation and the median flux. Figure \T3\ illustrates 
the result of this procedure for the V500 datacube of NGC 5947. 
A typical patchy structure results for most of the objects, due 
to the intrinsic differences in the signal-to-noise map in dithered 
exposures. 
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Fig. 14. Three-color images from the SDSS imaging survey (first panel), vs. those obtained from the V500 CALIFA data (second 
panel), for the same objects shown in Figure|7j NGC 641 1 (left), NGC 6497 (middle) and UGC 1 1262 (right). For the CALIFA data, 
u, g and r-band images were synthesized from the corresponding datacubes. The third panel shows a comparison between the az- 
imuthal averaged surface brightness profiles at the g-band, derived from synthesized images created from the V500 datacubes (blue 
circles) and the SDSS ones (red squares). The bottom panel shows the synthesized g-band image, color-scaled, together with 
two logarithm-scaled counter plots: one corresponding to the SDSS g-band image, in grey color, and the other corresponding 
to the synthesized image, in black 



Once the signal-to-noise map was obtained, we determined 
those pixels for which S/N ranges between 3 and 4. The aver- 
age flux corresponding to these spaxels (-100-200 spaxels) is 
considered a rough estimation of the 3cr detection limit of the 
considered datacube. This flux is transformed to the correspond- 
ing AB magnitude using the standard equations, and stored for 
further quality control. The depth of the datacubes depends on 
several factors, for each considered setup. It is mostly related to 



the night-sky brightness and the transparency and photometric 
stability of the night. Other factors, like the efl'ects of the atmo- 
spheric seeing, have a much reduced efl'ect due to the large size 
of the PPAK fibers (-2.7'' diameter). 

The datacubes reached a Scr signal-to-noise ratio for an av- 
erage surface brightness of -23.0 mag/arcsec^ for the objects 
observed under good weather conditions with the V500 grating, 
and -22.7 mag/arcsec^ for those observed with the VI 200 grat- 
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Fig. 15. Comparison of the central 5'' spectrum extracted from the V500 (thick back solid-line) grating datacubes with the corre- 
sponding spectrum obtained by the SDSS (thick red dashed-line), and the difference between both spectra (thin green solid-line). 
There are the only three galaxies in the presented sample with SDSS spectra: NGC 6394, UGC 09892 and UGC 1 1649 (from left to 
right) 



Table 2. Comparison of the PPAK central spectrum with the SDSS single-fiber spectrum. 



NAME PPAK central spectrum SDSS spectrum 





logio (Age) Gyr 
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cTy km s ^ 


logio (Age) Gyr 
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Ay 


cTy km s ^ 


NGC 6394 


1.1 


0.016 


0.7 


139 


1.3 


0.016 


0.5 


141 


UGC 09892 


0.7 


0.019 


0.4 


116 


0.8 


0.021 


0.3 


144 


UGC 11649 


1.3 


0.024 


0.2 


228 


1.0 


0.024 


0.2 


215 




Hof/H/? 


[OIIIl/H/? 


[NII]/HQf 


[SIIl^l^ 


Hof/H/? 


[OIII]/I^g 


[NII]/HQf 




NGC 6394 


6.9+ 2.0 


8.2+ 2.2 


1.1+0.1 


1.2+0.8 


6.6+ 1.7 


10.3+ 2.4 


1.3+0.1 


1.0+0.1 


UGC 09892 


8.0+ 3.5 


0.7+ 0.6 


0.5+0.1 


0.7+0.1 






0.5+0.1 


1.5+0.7 


UGC 11649 






1.1 + 1.1 


1.3+0.2 


3.8+4.1 


2.2+ 2.8 


1.1+0.5 


1.5+0.6 



ing. This diff'erence is expected since the diff'erence in resolu- 
tion between both setups is not completely compensated by the 
larger integration time for the latter grating. The area sampled 
with a S/N above 3cr is -0.7 arcmin^, for the V500 grating. This 



is clearly identified in Fig. 13 where the contours indicate the 
intensity level in the V-band, extracted from the datacube. The 
first contour, at ~3(T corresponds to -23.5 mag/arcsec^ in this 
particular object. Both the limiting magnitude and the area sam- 
pled over the 3cr limit agree with our expectations. The depth 
and area of the FoV sampled above the 3cr limit are very similar 
for all the observed targets: the 3cr surface brightness depth has 
a range of ±0.2 mag of the average values of each target and the 
area of the FoV above the 3cr limiting surface-brightness ranges 
between a 50% and a 90%. This is because, due to the diame- 
ter selection of the sample, all the targets have similar surface 
brightness at a similar projected distance independently of their 
real physical size or light distribution. In addition, as all the tar- 
gets are observed in dark time, there is no expected variation of 
the S/N or depth with night-sky brightness. 

Obviously, this is only true for the objects observed on clear 
weather conditions. Under bad weather conditions the S/N is de- 
graded. There are two cases of bad weather conditions consid- 
ered in the design of the survey: (1) completely lost nights (i.e., 
clouds, snow, rain...), and (2) nights with partial observations, 
i.e., nights with data taken under not-so-good weather condi- 
tions. For the first kind of nights, we have accounted an overhead 
of -30% , based on published e stimations of the fraction of use- 
ful time ( Sanchez et al.|2007a ), and they were considered in the 
total budget of nights. For the second case, our actual estimation 
is that only -10% of the targets are observed under sub-optimal 
conditions. Of those, -60% corresponds to the VI 200 setup, due 
to the larger fraction of time devoted to these observations. In 
summary, it is expected that we will obtain good quality data 



(i.e., with the described S/N and depth) for more than 500 galax- 
ies, once the survey is completed. 



6.6. Comparison with SDSS Data 

As indicated above, the CALIFA mother sample has been se- 
lected from the SDSS imaging survey. This selection provides us 
with a large ancillary dataset, including imaging in five bands for 
the complete sample and spectroscopic information for a consid- 
erable subsample of the objects (-2/3). A simple quality test of 
the obtained data is thus to compare them with the SDSS archive 
data. A collection of archive auxiliary data and an analysis of the 
properties of the mother sample will be presented in a future pa- 
per. 

A first sanity check that we perform on all reduced data is to 
visually inspect how they look compared to the SDSS images. 
For each datacube we synthesize the corresponding u (for both 
the V500 and VI 200 dataset), g, and r-band (for the V500 data) 
images, by convolving the cubes with the corresponding trans- 
mission curve, following the same procedure described previ- 
ously for the photometric recalibration. These images are com- 
pared visually with the corresponding images in the SDSS imag- 
ing survey. Some simple problems with the data acquisition (e.g., 
off-centering, error in the scaling between pointings, error in 
the dithering pattern...), show up clearly in this visual inspec- 
tion. Figure 14 illustrates the procedure, showing, for the same 
three objects shown in FigurejT] a comparison between the three- 
color images derived using both the SDSS and the CALIFA data. 
There is a clear morphological agreement between both image 
sets. Most of the structures seen in the SDSS images are clearly 
identified in the CALIFA data, in particular for the two brighter 
objects. 
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Fig. 16. Results of the fitting procedure applied to each single spectrum contained in the dataset, described in the text, for the 
six galaxies shown in Figures [7] and 15 Each panel shows the integrated spectrum of the corresponding galaxy (solid-black line) 
derived by coadding the spectra within a 3C diameter aperture centered on the peak emission in the V-band. The green line shows 
the best-fit multi-SSP stellar population, while the red line shows the best model derived for the emission of ionized gas. The blue 
line is the residual from the fit. 



Despite this qualitative agreement, there are more quantita- 
tive comparisons to illustrate the agreement with previously pub- 
lished broad-band images across the FoV (e.g., 'Sa nchez et al.| 
201 1 |). A simple one is to compare the surface brightness pro- 
files. We extracted the surface brightness profiles for the consid- 
ered g-band images, both synthesized from the V500 datacubes 
and obtained from the SDSS survey. The profiles were obtained 
by averaging the intensity within consecutive elliptical rings of 
3'' width, using the ellipticity and position angle listed in the 
SDSS-NYU catalogue for each object, and centred in the peak 
intensity of each image. The derived profiles are plotted in Fig. 
14 in the third panel. For each object the surface brightness pro- 
file derived from the g-band images derived from the datacubes 
is shown (blue solid circles), together with the same profile de- 
rived from the SDSS data (red solid squares). The error bars in- 
dicate the standard deviation around the average intensity. There 
is good agreement between both profiles at all the radii. In ad- 
dition to the surface brightness profiles, we include in the 
figure a comparison between the logarithm-scaled counter- 
plots derived from the SDSS g-band images and the corre- 
sponding ones synthetized from the CALIFA datacubes. The 
largest diff'erences are seen in the inner ~3'' radius, where the 
datacubes have a flatter distribution than found using higher res- 
olution images. This is expected since in some cases the dat- 
acubes have an incomplete coverage of the galaxy, due to the 
hexagonal pattern of the IFU. Within these two radii, the stan- 
dard deviation of the difl'erences between the surface brightness 
is -0.13 mag. This comparison illustrates the accuracy of the 
flux-calibration across the FoV. 

We also perform a cross-check using the spectra. For each 
object with an SDSS spectrum, a combined spectrum within a 
5'' diameter aperture is extracted from the V500 grating dat- 



acubes, centered on the intensity peak of the galaxy in the V- 
band. This aperture-extracted spectrum is selected to match the 
aperture of the SDSS fibers (~ 3''), taking into account possi- 
ble errors in the centroiding accuracy of these fibers and seeing 
/resolution eff'ects. Slight diff'erences due to seeing and resolu- 
tion are expected, as illustrated in Fig. [T4j bottom panel. In any 
case, no significant diff'erences are found when slightly diff'er- 
ent apertures (3 -6'') are selected, apart from the natural scaling 
between the spectra. Although the fraction of objects within the 
CALIFA mother sample with SDSS spectra is -60%, for the par- 
ticular dataset currently observed there were only 3 objects out 
of 21 with published SDSS spectra. The resolution of the SDSS 
spectra was degraded to match that of the V500 CALIFA data. 
Finally, a constant scaling factor was applied to the SDSS spec- 
tra to compensate for the diff'erences in apertures. 
Figure 



15 



shows the comparison between the aperture- 
integrated spectra and the corresponding SDSS spectra for these 
three objects, including the residual after the subtraction of one 
from the other. Both spectra agree to within -16% (rms of the 
residual) over the full wavelength range. Although the compar- 
ison sample is too small to derive statistical results, it is clear 
that the stronger diff'erences are found at the wavelengths of the 
emission lines (e.g., NGC 6394). These diff'erences are mostly 
due to a non-perfect matching of the spectral resolutions (which 
change along the wavelength in both cases), rather than to a real 
diff'erence in the intensity of each emission line (as we will show 
later). A slight diff'erence is also found in the blue-to-red spectral 
shape of UGC11649, which is clearly seen in the distribution of 
the residual spectra, although both spectra match within the ex- 
pected errors (taking into account our blue-to-red spectrophoto- 
metric accuracy). 

In summary, these preliminary quality checks show that: 
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Fig. 17. Comparison between the properties derived for the stellar populations on the basis of the analysis of the stellar indices and 
the results of the multi-SSP fitting procedure, for the central and integrated spectra. From from left to right and top to bottom, each 
panel shows the comparison of the [MgFe] index, the D4000 parameter, the equivalent width of H^, the luminosity- weighted age, 
the luminosity- weighted metallicities, and the dust attenuation of the stellar component. The orange squares indicate the parameters 
for the early-type galaxies (E or SO ones), while the blue circles indicate the parameters for the remaining galaxies. 
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Fig. 18. Comparison between the properties derived for the stellar populations on the basis of the multi-SSP fitting procedure (Age 
and Metallicity) and the corresponding properties derived on the basis of the analysis of the stellar indices (D4000, EW(H^) and 
[MgFe]), for the integrated spectra. The symbols are defined in Fig. [17 



- The accuracy of the wavelength calibration corresponds to 
~8 km s-i for the V500 (~3 km s'^ for the V1200), along 
the covered wavelength range. 

- The final spectral resolution in FWHM is ~6.5A, ie., cr ~150 
km s"^ for the V500 (~2.7A ie., cr -85 km s"^ for the 
VI 200). It has been homogenized across the FoV and along 
the wavelength range, taking into account the spectral and 
spatial variations. 



- The sky subtraction has a typical residual of -2% of the orig- 
inal flux of the night- sky spectrum. 

- The flux calibration is anchored to the SDSS photometry. 
The final photometry is accurate to -8% with respect to 
SDSS, with variation across the FoV of -0.13 mag/arcsec^. 

- The datacubes reach a Scr S/N depth at an average surface 
brightness of -23.0 mag/arcsec^ for the V500 data (-22.7 
mag/arcsec^ for the V1200 data), with -50-90% of the FoV 
above this S/N limit (with an average of -70% of the FoV 
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covered). It is estimated that -90% of the survey datacubes 
will reach this depth. 

In summary, the data have the quality foreseen in the pro- 
posal which will allow us to study the spatial distribution of 
different spectroscopic properties, including: (1) the ionized gas 
emission lines from [OII]/13727 to [SII]/16731, (2) the stellar 
population features from D4000 to Fe5335 and (3) the gas and 
stellar kinematics, with a resolution of -85 km s"^ . We conclude 
that we are on track for providing high-quality, well-calibrated 
and well-characterized reduced data to the users of the CALIFA 
survey. 

7. Verification of scientific usability 

In this section we describe a showcase analysis performed on 
the CALIFA data available to date, with the double purpose of 
verifying that we will be able to reach our science goals and of 
illustrating the information contained in the data. It is beyond 
the scope of this article to perform a detailed analysis of these 
data to achieve any of the goals of the survey, which in any case 
require a statistical sample of objects to be observed. 



ages, metallicities and stellar dust attenuation, together with the 
systemic velocity and velocity dispersion), and the properties of 
the analyzed emission lines (intensity, velocity and velocity dis- 
persion). No detailed comparison between the different available 
tools or a detailed error analysis is considered here. A simple 
SSP template grid was adopted, consisting of three ages (0.09, 
LOO and 17.78 Gyr) and two metallicities (Z ~ 0.0004 and 0.03). 
The models were extracted from the SSP template library pro- 
vided by the MILES project ( [Vazdekis et al.|20 10). This library 
is most probably too simple to describe in detail the nature of all 
the stellar populations included in the current dataset. However, 
it covers the space of possible stellar populations, and it allows 
us to obtain reliable information on the ionized gas. This fit- 
ting procedure has been adopted to analyze different aperture- 
extracted and single spectra within the datacubes, as we will de- 
scribe in the following sections. 



Figure 16 illustrates the results of the fitting procedure for 
the six galaxies also shown in Figures |7] and 15 In each panel 
the solid black line represents the integrated spectrum of the cor- 
responding galaxy (i.e., 30'' diameter aperture spectra), together 
with the best multi-SSP model for the stellar population (green 
line), and the gas emission recovered (red line). 



7.1. Continuum modeling and gas decoupling 

To extract the information contained in the datacubes, the ab- 
sorption spectra must be separated from the emissio n lines for 
each of the analyzed spectra (e.g., San chez et al.|2 011). Several 
different tools have been developed to model the underlying 
stellar pop ulation, effectively decoupling it from the emission 
lines (e.g., 'Cappellari & Emsellem"2004* 'Cid Fernandes et al.' 
2005; Ocvirk et al. 2006; Sarzi et al. 2006; Walcher et al. 2006; 
Sanchez et al. 2007a; Koleva et al. 2009; Mac Arthur et al. 2009; 
iSanchez et al.,2011) . Most of these tools are based on the same 
principles. It is assumed that the stellar emission is the result 
of the combination of different (or a single) single-stellar pop- 
ulations (SSP), and/or the result of a particular star-formation 
history. The stellar continuum is redshifted by a certain systemic 
velocity, broadened and smoothed due a certain velocity disper- 
sion and attenuated due to a certain dust content. Once the stellar 
continuum is determined, it is subtracted from each spectrum. 
This provides a pure emission line spectrum, including the in- 
formation of the ionized gas. The intensity, systemic velocity 
and velocity dispersion of each emission line can be derived by, 
for example, fitting with a set of Gaussian functions. In general 
it is known that the information derived from emission lines is 
much more accurate and stable than that derived for stellar pop- 
ulations (see [Walcher et al.|201 1[ for a review of the state of the 
art). However, an accurate decontamination of the stellar con- 
tinuum is required to measure emission line ratios, such as the 
[0III]/H/5 ratio, which allow us to interpret the data in terms of 
physical processes e.g. ionization source. 

It is beyond the scope of this article to analyze in detail 
the nature of stellar populations in the considered objects, since 
this will be one of the major goals of the full CALIFA project. 
Therefore, for the current demonstration we perform a simple 
modelling of the continuum emission. We use the routines de- 
scribed in Sanchez et al. ( 2011| ) and Rosales-Qrtega et al. ( |2010 ). 
These routines fit the underlying stellar population combining 
linearly a set of stellar templates within a multi-SSP model. They 
provide us with a number of parameters describing the physical 
components of the stellar populations (e.g., luminosity- weighted 



7.2. Comparing the results derived by CALIFA and SDSS 

The SDSS has become a well-appreciated standard for spectro- 
scopic data and also has been used to select the mother sample 
of the CALIFA survey. It is therefore of interest to make a quan- 
titative comparison of the information provided by both datasets. 
To this end we analyze both spect ra (s ee Section |6.6| ) using the 
fitting algorithm described in Sec. 7.1 Table [2] summarizes the 
results of this analysis. For each galaxy and each spectral dataset 
it lists the luminosity-weighted age, metallicity, dust attenuation 
and velocity dispersion, derived from the analysis of the stellar 
population. It also lists diagnostic emission line ratios, such as 
Ha/HyS, [OIII]/HyS, [NII]/Ha and [S II] 67 17/6731 line ratios. The 
parameters describing the stellar populations match reasonably 
well, considering that a typical e rror of -20-30% is expected in 
all of them ( ,Sanchez et al.|20lT ). The properties of the ionized 
gas match each other within the reported errors. 

Despite the small number of compared spectra, the good 
agreement in all derived spectroscopic parameters indicates that 
the current data provided by CALIFA have, on average and per 
fiber, a similar information content to that provided by the SDSS. 
The major advantage of CALIFA is the spatial coverage of the 
spectral information. 



7.3. Central vs. total spectra 

One of the science goals of CALIFA is to allow the extraction 
of observed spectra of particularly interesting regions or the full 
galaxy to produce integrated spectra using the IFU as an adaptive 
aperture spectrograp h. This technique has been used by Rosales-] 



http://www.caha.es/CALIFA/Accepted_Proposal.pdf 



Ortega et^ ( 2010| to produce aperture selected spect ra of HII 
regions within the galaxies of the PINGS survey, or in |Sanchez| 
|et al.| ( |2011| ) to derive the integrated spectrum of the complete 
galaxies. 

The full-aperture integrated spectra can be used to derive, 
for the first time, the real integrated spectroscopic properties of 
these galaxies, as opposed to previous studies that attempted to 
describe average properties by the analysis of individual spectra 
taken in different regions. The m ost similar approach would be 
the drift- scanning technique (e.g. Moustakas & Kennicutt|2006 
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Fig. 19. Comparison between the gas parameters derived for the central and integrated spectra using the fitting procedure described 
in the text. From from left to right and top to bottom each panel shows the comparison between the Ha flux, and the YLa/UjS, 
[OII]^3727/HyS, [SII]^6717/[SII]^6731, [NII]^6584/Ha and [OIII]^5007/HyS line ratios. Only values with a S/N>3 for both spectra 
have been included in each plot. 



and part of the ancillary data of the SINGS survey), although in 
those studies the fraction of the galaxy covered by the spectra 
was smaller than that of CALIFA. Another advantage of an IFU 
with respect to the drift-scan technique is that the former allows 
a comparison between the integrated and the spatially resolved 
properties of the galaxy, or the integrated spectra of particular 
regions. 

In this section we analyze two spectra derived for each 
galaxy for difl'erent apertures, using the V500-grating dataset. 
One is a 5'' diameter aperture spectrum, centered on the peak 
intensity in the V-band. This spectrum is representative of the 
central region of each galaxy, and is sim ilar to the one obtained 
by the SDSS survey (see Section 6.6). The other is a 30'' di- 
ameter aperture spectrum, an aperture that contains most fibers 
above the 5cr detection limit for each galaxy. The latter spectrum 
is representative of the integrated properties of each galaxy. Both 
spectra have similar S/N (-40, at -5000 A). This is due to the 
fact that the central spectrum samples areas of much higher S/N, 
despite its smaller aperture, while the total spectrum samples ar- 
eas of lower S/N, but with a larger aperture. On average, the 
5'' aperture contains 10% of the flux encircled by the 30'' aper- 



7.1 



ture. We applied the above fitting technique described in Sec 
to both spectra, deriving the main properties of both the stellar 
population and the ionized gas. An example of typical integrated 
spectra, and their modelling, was already shown in Fig. [16] 



7.3.1. Stellar Populations 

Gradients and, more generally, spatial variations of stellar popu- 
lation properties have been observed in galaxies, both early types 



(e.g. |Kuntschner et al.|2006| and late types (e.g [Mac Arthur et al 
2004|). CALIFA will allow the mapping of these variations for 
the first time over a statistically representative sample of galax- 
ies, unbiased in terms of morphology and mass/luminosity. The 
detailed modeling required to this goal is beyond the scope of 
this paper and will be presented in forthcoming dedicated works. 
Here we just illustrate the potential of the dataset using either 
model-independent measurements, such as the spectral indices, 
or the output of the simple fitting technique described in Sec. 

7.1. 

It is known (e.g. |Trager et al.|2000[[Ganazzi et al.|2005| ) that 
spectral indices can be used to infer stellar population parame- 
ters such as age, metallicity and a enhancement. They provide 
robust, model-independent, information, complementary to that 
provided by fitting the full spectrum with SSP templates as de- 
scribed in Sec. 7J_ For the current analysis, we have explored the 
D4000 index (i.e. the 4000 A break) and the index (which 
represents the equivalent width of the line), as tracers of stel- 
lar age, and [MgFe] as a tracer of stellar metallicity. For this later 
index, we adopted the formula: 

[MgFe] = ^/Mgb (0.72Fe 5270 + 0.28Fe5335) 

The indices were measured for both the central and inte- 
grated spectra, once decontaminated by the ionized gas emission 
and normalized to the standard Lick/IDS resolution, using the 
scripts included in FIT3D, described in Sanchez et al.| ( 2007b). 

Figure 17 shows a comparison of the properties derived for 
the underlying stellar populations for both spectral apertures. It 
includes the results from the analysis of the indices, i.e., the age- 
sensitive D4000 and indices and the metal-sensitive [MgFe] 
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Table 3. Summary of the physical properties derived from the analysis of the ionized gas for the integrated and central spectrum of 
each galaxy. 



Integrated spectrum 



NAME 


Spec. Type 


Ay 


SFR/zq, 


SFR[(9//] 


log(U) 


12+log(0/H) 


Electron Density 






(mag) 


M© yr"^ 


M© yr"^ 






cm~^ 


NGC 5947 


SF 


1.2 + 0.1 


3.8 + 0.1 


3.8 + 0.8 


-3.5 + 0.2 


8.7 + 0.1 




UGC 09892 


SF 


1.3 + 0.2 


2.3 + 0.1 


3.9 + 0.8 


-3.7 + 0.4 


8.7 + 0.3 




NGC 6394 


SF/AGN 


2.1 +0.2 


12.1 +0.1 


14.1 +2.8 


-3.2 + 0.3 


8.6 + 0.1 


214 + 64 


NGC 6497 


SF/AGN 


1.5 + 0.2 


3.7 + 0.1 


3.4 + 0.7 


-3.5 + 0.4 


8.7 + 0.1 




UGC 11262 


SF 


1.5 + 0.2 


1.8 + 0.1 


1.4 + 0.3 


-3.3 + 0.7 


8.6 + 0.2 


145 + 43 


NGC 6762 


AGN 


1.5 + 0.2 












UGC 11680 


AGN 


3.3 + 0.2 






-3.0 + 0.5 






UGC 11694 


SF 


0.0 + 0.5 


0.3 + 0.1 


0.2 + 0.1 


-2.9 + 0.7 


8.6 + 0.4 




UGC 11717 


AGN 


3.3 + 0.4 






-3.6 + 0.2 




4571+ 1371 


UGC 11740 


SF 


1.8 + 0.2 


3.5 + 0.1 


5.1 + 1.0 


-3.5 + 0.5 


8.6 + 0.2 


525 + 157 


UGC 12185 


SF/AGN 


2.8 + 0.4 


6.2 + 0.1 


22.1 +15.0 


-3.6 + 0.6 


8.6 + 0.2 




NGC 7549 


SF 


2.4 + 0.1 


10.8 + 0.1 


10.3 + 0.1 


-3.7 + 0.2 


8.7 + 0.1 




Central Spectrum 


NAME 


Spec. Type 


Ay 


SFR//Q, 


SFR[<9//] 


log(U) 


12+log(0/H) 


Electron Density 






(mag) 


Mo yr"^ 


Mo yr"^ 






cm~^ 


NGC 5947 


SF/AGN 


1.5 + 0.1 


0.4 + 0.2 


0.2 + 0.1 


-3.2 + 0.3 


8.7 + 0.1 




UGC 09892 


SF/AGN 


2.6 + 0.3 


0.2 + 0.2 


0.4 + 0.2 


-3.8 + 0.6 


8.7 + 0.3 


2512+ 754 


UGC 10710 


SF/AGN 


4.5 + 0.9 


3.9 + 0.1 


23.2+13.9 


-3.8 + 0.6 


8.6 + 0.5 




NGC 6394 


AGN 


2.9 + 0.3 






-2.8 + 0.1 




309 + 93 


NGC 6497 


AGN 


1.8 + 0.9 






-3.4 + 0.4 






UGC 11262 


SF/AGN 


2.7 + 0.4 


0.1 +0.2 


0.2 + 0.2 


-3.7 + 1.1 


8.7 + 0.4 


575 + 173 


NGC 6762 


AGN 


2.4 + 0.5 






-3.3 + 0.1 




234 + 70 


UGC 11680 


AGN 


2.3 + 0.2 






-2.5 + 0.1 






UGC 11694 


AGN 


3.1 + 0.5 






-3.3 + 0.2 




316 + 95 


UGC 11717 


AGN 


4.9 + 0.7 






-3.4 + 0.1 






UGC 11740 


SF/AGN 


2.8 + 0.6 


0.1 +0.2 


0.5 + 0.2 


-3.7 + 0.5 


8.6 + 0.3 


34 + 10 


NGC 7549 


SF/AGN 


2.6 + 0.2 


0.4 + 0.2 


0.3 + 0.2 


-3.4 + 0.3 


8.6 + 0.2 





index, and the physical parameters luminosity-weighted age, 
metallicity and dust attenuation, estimated by fitting SSP tem- 
plates to the full spectrum. The galaxy UGC11262 has been ex- 
cluded due to its lower S/N level in the continuum due to tech- 
nical problems during the observations. These problems aff'ect 
the shape of the continuum and therefore the parameters derived 
by the fitting procedure, although they aff'ect neither the stellar 
indices nor the parameters of the emission lines. 

Fig. [17] shows the correlations between central (within 5 arc- 
sec) and total integrated indices. Red dashed lines are used to 
display the one-to-one correlation and guide the eye. For all in- 
dices we find tight correlations between central and total val- 
ues. However, we find systematic off'sets with respect to the one- 
to-one line, which indicate gradients: in particular, for the stel- 
lar population indices, i) [MgFe] central is always significantly 
higher than total, thus indicating negative metallicity gradients in 
all galaxies; ii) US is typically stronger in the integrated spectra 
than in the central ones, thus indicating that young stellar popu- 
lations are mainly found in the outskirts of galaxies; Hi) D4000 
has typically flatter radial distribution, with some indication for 
positive (age) gradients for the oldest galaxies (those with the 
strongest D4000), and vice versa for the younger ones. 

Similar trends are appreciated in the parameters derived from 
the SSP fitting procedure. On average the central stellar popula- 
tions are slightly older than the integrated ones, and they suff'er 
a stronger or equal dust attenuation. The metallicity shows no 
clear trend, with values randomly distributed around the one- 
to-one relation. Maybe the selected library template, with only 
two extreme metallicities in the grid, is not good enough to 



sample this parameter properly, which is better represented by 
the [MgFe] index. Diff'erences are also appreciated among the 
diff'erent galaxy types. The stellar component of the early-type 
galaxies (E or SO) is, in all the cases, dominated by old (>7 Gyr, 
high D4000 and low values) and metal rich (Z >0.025, hight 
[MgFe] values) populations. In addition, very low dust attenua- 
tion is found in both the integrated and central spectra (Ay <0.5 
mag, with a mean value of -0.2 mag). On the other hand, the 
late-type galaxies show a wider variety of properties in their stel- 
lar populations. 

The information provided by the indices and the SSP anal- 
ysis is complementary, but leads to similar conclusions, in gen- 
eral. Figure [18] shows a comparison between each stellar popu- 
lation index and its corresponding parameter derived by the SSP 
analysis, for the integrated spectra (similar results are found for 
the central spectra). As expected there is a clear correlation be- 
tween the age-sensitive indices (D4000 and H^) and the lumi- 
nosity weighted ages, on one hand, and a weaker trend between 
the [MgFe] and the luminosity weighted metallicity, on the other 
hand. The larger scatter in this relation is most probably due to 
an incorrect sampling of the metallicities in the over- simplistic 
stellar population library grid adopted for the current analysis. 

Although a more detailed analysis is required to understand 
the described trends, these preliminary results seem very encour- 
aging, illustrating the kind of studies that can be done when ap- 
plying more refined analysis techniques over the full CALIFA 
sample. 
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Fig. 20. [OIII]/15007/HyS vs. [NII]/16583/Ha diagnostic diagram for integrated (left panel) and central (right panel) spectra of the 
currently observed CALIFA sample. Upper limits are indicated with an arrow, if only one of the two ratios could be accurately 
determined. We also plot data derived from the SDSS spectra of the CALIFA mother sample (orange open squares). The Kauffmann 
et al. (2003) (blue dashed-line) and Kewley et al. (2001) (red solid-line) demarcation curves are usually invoked to distinguish 
between SF galaxies and AGN with an intermediate region in between. 



7.3.2. Ionized gas properties 



Figure 19 shows a comparison between the emission line fluxes 
and ratios of the ionized gas in the integrated and central spec- 
tra, derived using the previously described fitting procedure 
(Sec. 7.1). The figure includes the observed flux intensity of 
the Ha emission line, and difl'erent line ratios between some 
of the strongest emission lines, including Ha/HyS, [OIIJ/Hof, 
[SII]6717/6731, [NII]/Ha and [OIII]/HyS. None of the plotted 
values have been corrected for dust attenuation, and only those 
with a S/N>3 have been included in each plot. Ha gas emission 
is detected in the integrated spectra of 15 galaxies, and in the 
central spectra of 16. As expected, most of the galaxies without 
detected ionized gas emission (i.e. ¥(Ha) < 3cr), either in the 
central or in the integrated spectra are early-type galaxies (E and 
SO). 

The comparison illustrates the importance of mapping the 
full optical size of galaxies to derive the average properties of the 
ionized gas. For example, the Ha intensity, as an additive prop- 
erty, is much larger in the integrated spectra than in the central 
aperture. The difl'erences in other non-additive properties (e.g., 
the Ha/UjS line ratio), illustrate changes in the physical condi- 
tions of the gas from the inner to the outer regions. 

The plotted emission line ratios can be used to distinguish the 
ionizing source of the gas associated with each region. In partic- 
ular, the [OIII]/HyS and NII/Ha can be used as a diagnostic probe 
which is almost insensitive to the dust attenuation. Figure [20 
shows the classical BPT diagram (Baldwin et al.|1981[ Veilleux 
& Qsterbrock|1987 ) for both the integrated (left-panel) and cen- 
tral (right-panel) spectra for the galaxies with detected ionized 
gas. We included in the figure the corresponding line ratios esti- 
mated from the 496 SDSS spectra of the CALIFA mother sam- 
ple with clearly detected ionized gas. The diagram shows the 
typical butterfly structure, with a left branch corresponding to 
HII regions powered by the ionizing radiation of young OB stars 
(i.e., star forming regions), while the branch on the right side is 
usually attributed to ionization by an AGN. We include the the- 
oretical boundary between ionization from stars (SF) and from 



an AGN from Kewley et al.' (2001) as well as the one empiri- 
cally derived by Kaufl^ mann et al. (2003). Objects are thus clas- 
sified as star forming (SF), AGN-ionized (AGN), or intermedi- 
ate (SF/AGN) if they lie between the empirical boundary and the 
theoretical one. Table |3] includes the result of this classification. 
Note that the current classification scheme has not considered 
the possibility of ionizat ion due to post-AGB stars as suggested 
by several authors (e.g., Trinchieri & di Serego Alighieri|199l[ 



"Bin ette et ai:][T994l [Stasinska et al.||20081 |Cid Fernandes et al.| 



2010t ISarzi et al.||2010| ). It is beyond the scope of the current 
study to analyze that possibility in detail, although it may lead to 
the reclassification of the ionized sources presented here. 

For the integrated spectra with both line ratios detected (+up- 
per limits) we find that 6(-hl) galaxies are classified as SF, 3(-hl) 
as AGN and 3(-\-l) as intermediate. On the other hand, based 
on the properties of the central spectra, 6(-\-l) galaxies are clas- 
sified as pure AGN and 6(-h3) as intermediate. The fraction of 
AGN detected in this first reduced sub- sample observed so far 
by CALIFA is much larger than the one expected on the basis 
of the analysis of the SDSS spectra (i.e. ~l/3 of the objects), 
as is clearly seen in the figure. Targets were selected from the 
mother sample purely based on observability. Furthermore, we 
have so far not found significant diff'erences between the proper- 
ties of the SDSS and CALIFA central spectra. Therefore, most 
probably this eff'ect is a result of small number statistics. 

That the fraction of galaxies classified as SF is larger for 
integrated spectra than for central spectra is of course expected 
as an aperture eff'ect. It is one of the design goals of CALIFA to 
quantify this eff'ect. 

In addition to the determination of the main ionization 
source, the emission line spectrum can be used to derive proper- 
ties of the ionized gas. The dust attenuation (Ay) of emission 
lines along the line of sight through a galaxy can be derived 
from the B aimer decrement, assuming an extinction law (e.g. 
Cardelli et al.|1989j ) and comparing the theoretical vs. observed 
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Fig. 21. Color magnitude diagram of the galaxies currently observed within CALIFA, for the SDSS u - r observed colors versus 
the r-band absolute magnitude. For each galaxy we plot in color a map for the continuum intensity at -6550 A down to a surface 
brightness of ~23 mag/arcsec^ AB (>3cr detection limit per spaxel), together with a contour plot of the Ha emission (if detected), 
derived by the fitting procedure described in the text (applied to the V500 data). The contours start at 0.3 10"^^ erg s"^ cm"^ arcsec"^, 
and are spaced on a logarithmic scale, following the equation / = 0.3 + N^-^ (0.3,1.3,3.1,5.5...). The NED name of the galaxies has 
been included, together with the orientation of the map on the sky. Note that some images have been shifted by up to ±0.5 mag 
around the nominal color-magnitude coordinate to avoid overlap. Nearby (projected) companions (e.g., NGC 7025) and foreground 
stars have not been masked (e.g., UGC 11694, UGC 11717). 



l(tla)/l(Ufi) ratio. Table |3] lists the derived dust attenuation for 
each object, for both the integrated and central spectr^ 

Extinction-corrected line fluxes can be used to derive the star 
formation rate (SFR), based either on the Ha or [Oil] 3727 line 
intensities. The SFRs can be derived adopting the classical re- 
lation between this parameter and the luminosity of both emis- 
sion lines ( |Kennicutt|1998| ). Note that the absolute luminosities 
at the distances involved in CALIFA are insensitive to the as- 
sumed cosmology 

Table [3] includes the results of this calculation for those 
galaxies in which the ionization is not dominated by an AGN. In 
most of the cases the SFRs derived using both lines are consis- 



^ Assuming Ry = 3.1 and case B r ecombination: temperature 10"^ K, 
density 10^ cm'^ ( |Osterbrock|l989| 
^ A Ho =70 km s Mpc~^ and qo=0.5 was adopted. 



tent within the errors. In a few cases (-10%), a much higher SFR 
is found from [Oil] than using Ua (e.g. UGC12185 for the in- 
tegrated spectrum, and UGC 107 10 for the central one). In these 
cases the observed [OII]/Hck ratio does not seem to be abnor- 
mally high. An inspection of these spectra shows that for such 
cases the correction of UJ3 by the underlying stellar absorption is 
particularly high, which may afl'ect the derivation of the dust at- 
tenuation and most probably produces an over-correction of the 
[Oil] flux. Galaxies for which the [OII]/HQf ratio is higher than 
the average are located in the intermediate region of the BTP di- 
agram, between the pure SF and pure AGN ionization regimes. 

The emission lines present within the spectral range of 
CALIFA allow us to examine other properties of the ionized gas, 
such as the hardness of the ionization, typically characterized 
by the ionization parameter U, defined as the ratio of the den- 
sity of ionizing photons to the particle density. This parameter 
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Fig. 22. Color maps showing the distribution of the emission line ratios between [NII]/16583 and Ha, in a color-magnitude diagram 
similar to that of Figure [21] In the present case the solid contours show the intensity of the Ha emission, adopting the same levels 
as in Figure 21 The dashed blue contours show 3 intensity levels of the continuum emission at -6550 A (starting at 3 10"^^ 
erg s"^ cm"^ arcsec"^). They have been included to indicate the physical extent of the continuum emission in the galaxies. 



is best determined using the ratios of emission lines of the same 
element originating from different ionization stages. The lines 
available in our spectra allow us to derive the ionization parame- 
ter from the ratio [O ii]/[0 iii]=^3727/(/l4959+/l5QQ7 ), using the 
relation log u = -0.80 log( [0/7] /[O///]) - 3.02, after toiaz et a T] 
( |2000| . The derived ionization parameters are listed in Table [3] 
The reported values are in the higher range co mpared to most 
of the known HII regions (e.g. the Orion Nebula, [Sanchez et"aLl 
2007c). The main reason for these high values is most probably 
the presence of an AGN in a substantial fraction of the galaxies 
of the current sample. 

Gas-phase oxygen abundances can also be obtained from 
the emission lines present in the CALIFA data. However, a di- 
rect oxygen abundance determination requires the presence of 
temperature- sensitive auroral lines such as [Oiii]4363 A and/or 
[Nii]5755 A, which are only present in the low-to-intermediate 
metallicity regime and are generally much fainter than the corre- 
sponding nebular ones, our spectra are not deep enough to de- 
tect them, in most of the cases. Nevertheless, we can turn to 



strong-line methods to derive the gas metallicity (e.g., Kewley 



& Ellison 2008} Lopez- Sa nchez & Esteban"2010), bearing in 
mind that these calibrations are only valid for gas ionized by 
star formation. In the case of hard ionization due to AGN, the 
derivation of the oxygen abundance is largely unexplored, and 
only detailed photo-ionization models can be used to quantita- 
tively estimate it (e.g. Storchi-Bergman n et al.| |1998 ). For the 
purpose of the current exploratory study, we restrict ourselves to 
objects which are not AGN-dominated and we adopt the sim- 
ple 03N2 indicator Pettini ^XPagell ( [20041 ), 12 -h log(OH) = 
8.73 -h 0.32 logio %%^A65S3/^ • derived abundances are listed 
in Table |3] They are in the range of values expected for galaxies 
of this kind. No significant variation is found between the values 
reported for the central and integrated spectrum. 



Table [3]lists also the electron density of the ionized gas from 
the [S ii]6717 A/6731 A doublet ratio. This particular line ratio 
is sensitive to the electron density only for a particular range 
of values ( |Qsterbro ck 1989). Only electron densities within this 
range are listed in Table |3] A wide range of values is found for 
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Fig. 23. [OIII]/15007/HyS vs. [NII]/16583/Ha diagnostic diagram for the individual spectra within the V500 datacubes corresponding 
to two different objects, UGC 11680 (left panel) and NGC 7549 (right panel) in the currently observed CALIFA sample. The solid 
and dashed-lines are similar to those shown in Figure 20 The size and colors of the symbols indicate the distance to the center of 
the galaxy, with bluer and smaller circles located in the inner regions and orange and larger ones located in the outer parts. 



the electron densities, indicating different gas conditions for the 
different objects and spatial regions. 

We conclude from this section that the emission line spec- 
tra delivered by CALIFA live up to the expectations in terms of 
their scientific usability. Future analyses should be performed, 
improving the subtraction of the underlying stellar population, 
the treatment of dust attenuation and the derivation of the oxy- 
gen abundance. 

7.4. Spatial distribution of the spectroscopic properties 

In the previous section we have described the results of the anal- 
ysis of single spectra, selected from two different apertures. A 
further design goal of CALIFA is, however, the analysis and 
comparison of the spatial distribution of those properties. We 
now turn towards showing the power of CALIFA in this domain. 

We have applied the fitting procedure described in previous 
sections to each individual spectrum of each datacube, for both 
the V500 and VI 200 datasets. For each spaxel of each datacube 
we thus recover the corresponding spectral information, and it 
is therefore possible to create two-dimensional maps for each of 
the derived parameters that describe the stellar populations (lu- 
minosity weighted ages, metallicities, dust attenuation, and/or 
coefficients of the mix of the adopted SSP templates), and the 
ionized gas (line intensities, dust attenuation, diagnostic line ra- 
tios, ionization strength, oxygen abundance, electron density). In 
addition, the two-dimensional kinematic structure for both the 
ionized gas and the stellar populations are derived. We will il- 
lustrate the results of this analysis in the next figures. 

Figure 21_ shows the color-magnitude diagram for the galax- 
ies of the analyzed subsample, based on the SDSS u-r observed 
colors and the r-band absolute magnitude. For each galaxy, we 
plot a color-map for the continuum intensity at -6550 A down 
to a surface brightness of -23 mag/arcsec^ AB (>3cr detec- 
tion limit per spaxel), together with a contour plot of the Ha 
emission (if detected). The names of the galaxies have been in- 
detified. As expected, the brightest and reddest galaxies show 
the least ionized gas emission. These dry or almost dry galax- 
ies are mostly morphological early-types (E and SO). Some of 
them show some emission lines in the central regions. However, 
the detected emission in Ha is below the conservative detection 



limit adopted in Table [3] A more detailed decoupling analysis 
between the emission lines and the underlying stellar population 
is needed to ensure the reliability of this detection, in particular 
of E and SO galaxies (Kehrig et al., ). 

A greater morphological diversity and a larger range of col- 
ors is found among the galaxies with ionized gas. There are the 
typical face-on (e.g. NGC 5947, NGC 6497) and edge-on (e.g. 
UGC 10710, UGC 09892) spiral galaxies; galaxies with strong 
bars (e.g. NGC 6392); low surface brightness galaxies (e.g. UGC 
11740), and galaxies with evidence of recent interactions (e.g. 
UGC 11680 and NGC 7549). The two reddest galaxies with gas 
(UGC 11680 and UGC 11717) both harbor an AGN and have 
strong dust attenuation in their integrated spectrum (as derived 
from the ionized gas, Table[3|. The spatial distribution of the ion- 
ized gas also shows a wide diversity. In some galaxies, the ion- 
ized gas is concentrated in the central regions (e.g. NGC 6762, 
dominated by a ionization different than star formation), while 
in other objects the ionized gas follows the spiral arms (UGC 
11680, NGC 5947 or NGC 7549), or is located in a ring (UGC 
1 1649). The ionization source in most of these galaxies is purely 
star formation, based on their integrated spectra. The most dis- 
torted Ha morphology is shown by NGC 7549, which is also the 
galaxy with the highest SFR of all those dominated by pure star 
formation in the integrated spectrum. The SDSS image of this 
object shows the typical distorted morphology produced by an 
interaction and/or merging process. 

The ionization conditions change considerably not only from 
galaxy to galaxy, but also within each galaxy. Figure [22] illus- 
trates these changes, by showing the spatial distribution of the 
[NII]/16583/Ha emission line ratio, for each galaxy. In Figure [20 
it was shown how this line ratio, in combination with [OIII]/HyS, 
can be used to derive the main source of the ionization. In most 
of the ionization models, values of log([NII]/H(2)>l cannot be 
produced by star formation, even in the case of very strong star- 
bursts. Figure 22 shows that such values are only found in the 
central regions of some objects: NGC 6762, NGC 6497 and 
UGC 11649. In all these cases the ionization is clearly domi- 
nated by an AGN (Table |3]). However, while in some the ion- 
ization is concentrated in the central region (NGC 6762), in 
other cases there are clear extended emission regions most prob- 
lably ionized by the AGN (NGC 6497 and UGC 1 1649). Similar 
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Fig. 24. Color maps showing the distribution of the D4000 stellar population index in a color-magnitude diagram similar to that of 
Figure [2T 



comparative analyses between different galaxy types and within 
galaxy classes can be performed using any of the parameters that 
characterize the ionized gas, like those listed in Table [3] 



Figure[23]shows two diagnostic diagrams, similar to the ones 
presented in Fig. |20| corresponding to the individual spectra 
within the V500 datacubes of two objects, UGC 1 1680 and NGC 
7549. These two diagrams illustrate how the ionization source 
changes with location, as already illustrated in Fig. [22] The 
track along the diagnostic diagram due to pixel-to-pixel varia- 
tions in a single galaxy is an i mportant tool to understand th e 
nature of the ionization (e.g.. Sharp & Bland-Hawthorn 2010). 
In the case of UGC 11680, there are regions corresponding to 
all three regimes, dominated by star- formation, intermediate and 
dominated by an AGN. On the other hand, the gas ionization in 
NGC 7549 is clearly dominated by star-formation, with a tail of 
-10% of the spectra showing possible intermediate ionization 
(e.g., |Alonso-Herrero et a l. 2010 ). In future studies these dia- 
grams will be used to separate the different ionization sources 
in the spatial domain, trying to correlate them with other local 
properties of the galaxy. 



Figure [24] shows maps of the D4000 stellar population in- 
dex, sensitive to the age of the stellar component (e.g.. Fig [18]), 



in a similar color-magnitude diagram to Fig. [21] Radial gradients 
are readily observed, which in some cases highlight abrupt tran- 
sitions between different structural components of the galaxies 
(e.g. bulge, bar, disc). As expected, the morphologically early- 
type galaxies (E/SO), bright, red and dry, are dominated by old 
stellar populations (D4000>2), with little variation within their 
optical extent. Late-type galaxies, on the other hand, show a 
wider variations of the age of the stellar population, both from 
galaxy to galaxy and within the same galaxy. In most cases there 
is a gradient in the ages, such that the central region is older than 
the outer parts. This is most evident for the face-on spirals. 

A more detail analysis of the variation of the stellar pop- 
ulations across the field requires the comparison of either the 
distribution of the luminosity-weighted ages and metallicities 
or the corresponding indices. Figure 25 shows the distribution 
of the metallicity-sensitive [MgFe] index and the age-sensitive 
D4000 index, derived spaxel-to-spaxel across the field of view 
of the two galaxies shown in Fig. [23] Only those spaxels with 
an intensity above 3 10"^^ erg s"^ cm"^ arcsec"^ (S/N~10) at 
-5000 A have been included in the plots. Both figures illustrate 
in more detail the change in the stellar populations from the in- 
ner to the outer regions. 
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Fig. 25. Distribution of the metallicity sensitive [MgFe] stellar index and age sensitive D4000 index across the field of view for the 
two galaxies shown in Figure 23 The colors and sizes of the circles indicate the distance from the center, defined as the intensity 
peak in the V-band. The bluer/smaller circles indicate the central regions and the orange/larger symbols the outer ones. The figure 



illustrates in more detail the change in the stellar populations described in Figure 24 



It is interesting to note that the stellar populations of NGC 
7549 show a clear gradient, with the inner regions dominated 
by old and metal rich stars, and the outer ones dominated by 
younger and metal poor ones. This is the typical trend expected 
in an inside-out secular evolution of a disk-dominated galaxy. On 
the other hand, this gradient is less evident in UGC 11 680, where 
there is a more diverse mix of stellar populations at all radii. 
While both galaxies are face-on spirals with evidence of a recent 
interaction, the gas ionization is dominated by star-formation in 
most of the extension of NGC 7549, while UGC 11680 hosts 
an AGN. Whether this is related to the observed diff'erences in 
the stellar populations is one of the main goals of the CALIFA 
survey, i.e., to study the inter-relation of the spatially resolved 
spectroscopic properties of diff'erent types of galaxies. 

The results presented in this subsection illustrate the poten- 
tial of the CALIFA data to study the spatially resolved properties 
of the stellar population of galaxies. We will analyze the implica- 
tion of these results on our current understanding of the evolution 
of galaxies in forthcoming articles. 

7.5. Kinematic maps 

The stellar kinematics were derived from the VI 200 dataset in 
the wavelength range between 3850 and 4600 A throu gh th e stel- 
lar population fitting procedure described in Section |7.1| Only 
the VI 200 datacubes with a 3cr surface brightness detection limit 
above 22 mag/arcsec^ are used. For the remaining objects, we 
performed a similar analysis from the corresponding V500 data, 
restricted to a similar wavelength range. The accuracy in the 
derivation of the stellar kinematics is expected to be better for 
the VI 200 data than for the V500, in particular in the analysis of 
the velocity dispersion. 



Figures |26] and [27] show the velocity maps derived for the 
stellar population and the ionized gas, respectively, for each in- 
dividual galaxy within the current dataset. These velocity maps 
were derived based on spaxel-to-spaxel analysis, without adopt- 
ing any optimal binning. A proper binning will increase the ac- 
curacy of the kinematic analysis, as described below. The veloc- 
ity maps are labelled with the corresponding grating name of the 
data used to derive them. Similar diagrams can be constructed 
for the velocity dispersion maps. Combined these diagrams al- 
low us to classify the galaxies in terms of ordered vs. random 



motions by identifying non-regular motions and disturbances. 
For the gas kinematics we have chosen here to present the Ha 
kinematics, since this emission line is one of the strongest. No 
significant diff'erences are found when using other strong emis- 
sion lines. 

Obviously, the gas kinematics can only be derived in those 
regions (or objects) with detected emission lines. For those 
galaxies, mostly late-type objects, the kinematics of both the gas 
and the stars are dominated by rotation, with an amplitude of 
±150 km s"^ There are no significant diff'erences between the 
gas and stellar kinematics, at least for the objects observed so 
far, for those galaxies for which the stellar and gas kinemat- 
ics is sampled in the same physical regions: i.e., regions with 
high S/N ionized gas and stellar continuum. In a few cases (e.g., 
NGC6497 and UGC 1164), we detect high S/N ionized gas in 
regions with low intensity continuum. For those galaxies we do 
not have information of the stellar kinematics derived spaxel- 
to-spaxel in those regions where we have derived the ionized 
gas one. In these cases the gas kinematics fits well with the ex- 
pected values from the extrapolation of the stellar kinematics to 
these outer regions. NGC 7025 is a special case because it is 
the only morphological late galaxy without detected gas emis- 
sion (so far). This galaxy, apparently dry, bright and red, shows 
a clear rotational pattern, consistent with its morphological clas- 
sification. 

All galaxies without rotation are morphologically elliptical 
galaxies: UGC 12127, UGC 10693, NGC 6411 and NGC 6515 
(4 out of the 5 elliptical galaxies in the current sample). We 
have no gas kinematics for these, since no emission lines were 
detected in these objects. On the other hand, their stellar kine- 
matics show a patchy structure, consistent with a flat distribu- 
tion, without significant gradients. Although rotation may be- 
come apparent through a more detailed analysis , making use of 
spatial binning approaches, we detect none for the moment. 

On the other hand, the remaining early-type galaxies, NGC 
7194 (E), UGC 11228(SB0) and NGC 6762(S0/a), show signif- 
icant rotation. Emission lines were detected only in NGC 6762 
and only in the central region, with line ratios consistent with the 
presence of an AGN. Particularly interesting is the case of NGC 
7194, an object that does not present any morphological signa- 
ture of a disk. When compared to other galaxies classified as E, 
e.g. NGC 6515, both objects have similar morphologies, colors. 



27 



S.R Sanchez et al.: CALIFA, the Calar Alto Legacy Integral Field Area survey: 



3.5 



3.25 



c/3 3.00 

m 

Q 

CO 

^ 2.75 



2.5 - 



2.25 



T 



E V500 




B9V 





N 



S V500" 




-100 100 

Vstars (km/s) 




E V1200 S 




V5@0 « SB(rla V500 



^ >.^(r>rVl200 

SBO V1200 
E V500 E V500 * A 

^ SbVljOO ^ ^ 

^ S^(s)ab V500 
SBb V1200 





SBa V1200 




so/a V500 




Sd V500 



SBbeiyi200 




Sb V1200 




-22.5 



-22.0 



-21.5 



-21.0 



-20.5 



-20.0 



-19.5 



M,bs r SDSS 



Fig. 26. Color maps showing the velocity field of the stellar component from the V1200-grating dataset in a color-magnitude 
diagram similar to that of Figure[2T] For those galaxies for which the 3cr limiting surface brightness per spaxel was brighter than 22 
mag/arcsec^, the velocity map was derived from V500 dataset. The contours show the intensity level of the continuum emission at 
-4000 A, for the same levels as in Figure 21 



and luminosities. The continuum emission of both galaxies is 
dominated by old (>8 Gyr) and super-solar (Z -0.028) stellar 
populations, with small or no dust attenuation(Ay <0.1 mag). 
Despite all these similarities, both objects clearly show a diff'er- 
ent kinematic behavior. However, a full analysis of the velocity 
and the velocity dispersion maps is required to disentangle the 
real nature of the kinematic structure of these objects. 

We use the h igh S/N integrated and central spectra described 
above (Sec. |7.3[ S/N>40) to estimate the accuracy in the deriva- 
tion of the stellar velocity. First, each spectrum was split into 
consecutive, non overlapping wavelength ranges of 200A. Then, 
each sub- spectrum was fit with a multi-SSP, adopting the proce- 
dure described in Sec. 



7.1 



The stellar systemic velocity derived 
for each wavelength range is recorded. The typical standard de- 
viations around the central values are ~5 km for the VI 200 
data, and ~10 km for the V500 one. Taking into account that 
200A is only a short wavelength range, these values agree with 
the expectations based on the accuracy of the wavelength cali- 
bration (Sec.|6.2|). 



This estimation of the accuracy of the kinematic properties 
may be misleading, since the S/N of the individual spaxels is 
lower than that of the integrated and central spectra. To de- 
rive maps of the stellar kinematics, including the mean velocity 
and velocity dispersion we need a minimal S/N per spectrum, 
whic h requires the ad option of a spatial binning technique (e.g. 
Emsellem et al.|20Q4 ). First, we apply an a priori cut threshold 
in the signal-to-noise per spaxel of S /N >3. Next, we perform a 
Voronoi binning following ' Cappellari & Copin| ( |20 03). Diff'erent 
experiments have been performed to derive the optimal limit of 
the S/N goal in the binned data, regarding the accuracy of the 
derived parameters. For the velocity and velocity dispersion a 
compromise has been adopted between maximizing the S/N per 
bin and minimizing the size of the final bins: i.e., keeping as 
much spatial information as possible. 

Our tests indicate that to achieve a typical accuracy of ~5 
km s"^ in the mean velocity and a typical accuracy of ~15 km s"^ 
in the velocity dispersion, for the VI 200 data, a final S/N per 
bin of ~20 is required. With this S/N requirement, the typical 
bin has a projected size of a few arcseconds. Figure [28] illus- 
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Fig. 27. Color maps showing the velocity field of the ionized gas derived from the Ha emission. The contours are similar to those 
shown in Figure 22 



trates the result of these experiments, showing, for the velocity 
and velocity dispersion maps, together with the estimated errors 
for both parameters based on an extensive Monte Carlo simula- 
tion for a particular target. At ~20-3C from the center of the 
galaxy, there is a rise in the error to -30-40 km s"^ This eff'ect 
is stronger in the distribution of velocity dispersions. This is due 
to the slight decrease of the S/N per bin and the fact that in the 
outer regions the velocity dispersion is smaller (in general), and 
therefore harder to measure. This eff^ect is sh own in the simu- 
lations presented by Marmol-Queralto ( 2011| ), for the CALIFA 
feasibility studies. 

Similar results are derived for all the galaxies observed so- 
far, as expected due to the sample selection and survey strategy 
which ensures a similar S/N pattern across the field for the diff'er- 
ent targets. A more detailed report on the accuracy of the stellar 
kinematics will be described in forthcoming papers. 



7.6. Kinematics Analysis 

As indicated in the introduction one of the goals of the current 
survey is to use the stars and gas kinematics to derive physical 
properties of the galaxies, such as dynamical mass. This requires 



a detailed modelling of the velocity curves (and dispersion maps) 
of each particular galaxy, which is outwith the scope of the cur- 
rent article. To probe the quality of the current data to perform 
these proposed studies a simple modelling of the velocity is per- 
formed. We used the maps shown in Figure [26] trying to recover 
the asymptotic rotation speed (which is related to the dynamical 
mass), for the disk-galaxy NGC 5947. 

For this study the observed rotational curve was de- 
projected. De-projection requires the position of the kinematics 
axis and the inclination angle to be known. The kinematic center 
was fit by eye using the continuum and Ha intensity maps, and 
the axes were adjusted to give the most accurate rotation curve, 
i.e., symmetrical and with singular velocity at the center. The in- 
clination angle was found from the 2MASS K-band axis ratios 
assuming an infinitesimally thin circular disk. 

This deprojected rotation curve was fit with a simple (but 
robust) arctan function of the form, 

2 Vf 

v(r) = Vsys + arctan(^r - c) 

71 

where v^3;^ is the systemic velocity of the gas and is the asymp- 
totic rotation speed of the disc, s characterises the slope of v(r) 
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Fig. 28. Results from the stellar kinematics analysis. From left-top to right-bottom the binned maps corresponding to the derived 
stellar velocity, its error, the stellar velocity dispersion and its corresponding error, all in units of km s"^ Values with errors larger 
than 35 km s"^ have been masked-out, and indicated with grey-colors. 



in the inner part of the galaxy and c is the parameter that char- 
acterises any offset in the rotation axis of the galaxy. A model 
of the velocity map was created by re-projecting the best fitting 
arctan function. Figure [29] shows the original velocity map, the 



re-projected model and the residual map, in which the model 
is subtracted from the original map. The areas with noisy gas 
emission, excluded from the fitting analysis, have been masked 
from the figures. The residual map highlights the deviations from 
purely circular velocity. The gas velocity map of NGC5947 is 
well described by a purely rotating disc, assuming an asymptotic 
velocity (v^ -230 km s"^) and an inner regions slope (s -0.3 
arcsec"!). The residuals are in general lower than 30 km s"^ 
across the field. 

The conclusion from our exploratory kinematic analysis is 
that the global information that CALIFA will provide will allow 
us not only to classify galaxies through their kinematics, but also 
to build rotation curves and dispersion profiles (as shown in 
Fig. [28]), as well as detailed mass models for a large sample of 
galaxies of all morphological types and over a wide range of 
stellar mass. 



8. Conclusions 

The Calar Alto Legacy Integral Field Area survey (CALIFA sur- 
vey), has been designed to be the first survey to provide integral 
field spectroscopy data for a statistical sample of all galaxy types 
in the local Universe. On completion it will be the largest and the 
most comprehensive wide-field IFU survey of galaxies carried 
out to date. It will thus provide a valuable bridge between large 
single aperture surveys and more detailed studies of individual 
galaxies. 



In the first part of this paper we have presented the survey de- 
sign, the data reduction pipeline and a number of quality control 
procedures. We have reached the following conclusions: 

- At completion the survey will yield a statistically signifi- 
cant sample of -600 galaxies, which are representative of 
the galaxy population in the local Universe. 

- Already at this early stage in the survey we have a working 
data reduction pipeline, which operates without human inter- 
vention. We will be able to improve it regularly and provide 
consistently re-reduced complete datasets to the community. 

- To give the community a feeling for the quality of data that 
CALIFA will deliver, we have compared the CALIFA spec- 
tra to the more generally known SDSS spectra for equiva- 
lently sized central regions of those galaxies that are in com- 
mon between both datasets. From our preliminary analysis 
we find that the information content of the CALIFA spectra 
is at least equivalent to those of the SDSS (for a matched 
aperture). 

- We have presented the main quality parameters of the 
dataset: depth, spectrophotometric accuracy and stability, 
wavelength calibration and final spectral resolution. The 
analysis of the quality of the currently observed objects in- 
dicates that we reach the target depth for the survey in 75% 
of the datacubes and that the loss of depth in the remain- 
ing cases can be clearly attributed to adverse observing con- 
ditions. From all our quality checks we conclude that we 
are on track for providing high-quality, well-calibrated and 
well-characterized reduced data to the users of the CALIFA 
legacy survey. 

In the second part of the paper we have presented an ex- 
ploratory analysis of the first set of 21 galaxies observed through 
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Fig. 29. Results from the gas kinematics analysis. The left panel shows the Ha velocity map of NGC 5947 shown in Figure 27 
The central panel shows the best-fit model assuming a simple arctan-rotational curve. Finally the right panel shows the residua 
after subtracting the model to the velocity map. The spaxels not used in the fitting procedure have been masked. Contours indicate 
diff'erent velocity levels, with constant step of 30 km s"^ in the first two panels, and 10 km s"^ in the last one. 



2010 in order to verify the scientific usability of the data. We 
reached the following conclusions: 

- We have shown that the absorption line spectra yield in- 
formation on ages (and therefore mass-to-light ratios) and 
metallicities of the stellar population. We expect that with a 
more stringent analysis we will also be able to recover abun- 
dance ratios for the a elements. 

- We have shown that the CALIFA emission line spectra will 
yield dust attenuation, star formation rates, excitation mech- 
anisms and ionization parameters, metallicities and electron 
densities. 

- The conclusion from our exploratory kinematic analysis is 
that the global information from CALIFA will allow us to 
not only classify galaxies through their kinematics, but will 
also allow us to build rotation curves and dispersion profiles, 
as well as detailed mass models for a large sample of galaxies 
of all morphological types and over a wide range of stellar 
mass. 

- CALIFA will allow us to quantify the eff'ects of sampling dif- 
ferent physical aperture sizes. As an example we have shown 
that the fraction of galaxies classified as star forming (vs. 
AGN) is larger for integrated spectra than for central spectra. 
This is of course expected. CALIFA, however, will allow us 
to quantify these eff'ects. 

- Finally, we have shown how comparative analyses between 
diff'erent galaxy types and within individual galaxies can be 
performed using any of the derived parameters. CALIFA will 
thus be the first survey allowing comparative studies of the 
2D distribution of galaxy properties in a statistically mean- 
ingful way, including galaxies of many types. 

With two-dimensional maps of all these observables in hand, 
we expect that the data from the CALIFA survey will 1) charac- 
terize the local galaxy population in a way that will constitute a 
benchmark for models and other datasets alike and 2) will pro- 
duce qualitatively new results on many topics of current active 
research, such as the build-up of galaxy disks, the influence of 
environment, galaxy bi-modality (morphology, kinematics, stel- 
lar populations) and possible transition objects (green valley), 
comparative studies of AGN and non-AGN galaxies, kinematic 
galaxy classification, stellar vs. dark mass, the origin and evolu- 
tion of the warm ISM, to name but a few. 

CALIFA data will become public in regular data releases and 
we very much hope that the community will actively use the data 



for their own projects - only then will CALIFA have fulfilled its 
main goal, namely to be a legacy survey. 
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